INDEX
Explanations
proper names related to various topics or entities
references to "The" followed by a noun or proper noun
New Auto-Interp
Negative Logits
beware
-0.84
aloud
-0.80
ionics
-0.78
partake
-0.74
forcefully
-0.69
stopping
-0.68
patiently
-0.68
thood
-0.67
abound
-0.66
recite
-0.66
POSITIVE LOGITS
Hague
1.18
odor
1.16
atre
1.14
Simpsons
1.11
orem
1.11
oret
1.09
Economist
1.05
Greatest
1.01
aters
1.00
Beatles
1.00
Activations Density 0.104%