INDEX
Explanations
proper nouns or names of individuals
instances of the word "took."
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.70
cape
-0.69
constitu
-0.63
Cong
-0.61
Smile
-0.61
lex
-0.61
gom
-0.60
rehens
-0.59
lite
-0.59
idding
-0.59
POSITIVE LOGITS
advantage
1.18
aback
1.16
pains
1.09
refuge
1.03
aways
0.91
exception
0.88
heed
0.86
precautions
0.85
aim
0.83
liberties
0.82
Activations Density 0.078%