INDEX
Explanations
phrases indicating evidence or examples supporting a stated proposition
instances of the word "Such" indicating a pattern or emphasis in the text
New Auto-Interp
Negative Logits
oil
-0.69
kick
-0.68
osph
-0.64
Ö¼
-0.64
Polo
-0.63
creen
-0.63
olate
-0.62
Loaded
-0.61
office
-0.61
iliate
-0.61
POSITIVE LOGITS
ities
0.81
matters
0.79
ties
0.79
embodiments
0.67
discrepancies
0.67
cond
0.64
cancell
0.63
phenomena
0.63
Takeru
0.63
fter
0.63
Activations Density 0.035%