INDEX
Explanations
mentions of the name "John."
New Auto-Interp
Negative Logits
apiro
-0.18
erti
-0.17
SpinBox
-0.16
/stdc
-0.15
ivol
-0.14
ietet
-0.14
eshire
-0.14
ела
-0.14
خرد
-0.14
IRM
-0.14
POSITIVE LOGITS
Lith
0.19
Trav
0.19
Derek
0.19
leg
0.18
fav
0.18
Fav
0.18
lith
0.17
trav
0.17
utan
0.16
Candy
0.16
Activations Density 0.012%