INDEX
Explanations
phrases starting with a comma
occurrences of the word "but."
New Auto-Interp
Negative Logits
interstitial
-0.74
ãĤ¶
-0.65
IU
-0.65
ORD
-0.63
ļéĨĴ
-0.63
olves
-0.62
Ô
-0.61
ords
-0.61
coverage
-0.58
APH
-0.58
POSITIVE LOGITS
alas
1.17
uh
0.90
yeah
0.87
secondly
0.79
needless
0.76
moreover
0.76
yes
0.75
according
0.73
lest
0.73
unlike
0.73
Activations Density 0.121%