INDEX
Explanations
personal pronouns and verbs related to statements
the pronoun "he" in various contexts
New Auto-Interp
Negative Logits
iries
-0.65
display
-0.64
Bearing
-0.63
iframe
-0.63
Seym
-0.62
history
-0.62
Pastebin
-0.59
ا
-0.58
equate
-0.57
etheless
-0.57
POSITIVE LOGITS
Majesty
1.02
zbollah
0.99
resy
0.87
eded
0.86
mos
0.78
ALTH
0.77
'd
0.75
bert
0.75
ather
0.71
majesty
0.70
Activations Density 0.422%