INDEX
Explanations
the pronoun "he" in various contexts
New Auto-Interp
Negative Logits
agnet
-0.16
tab
-0.16
kest
-0.16
.realm
-0.16
dep
-0.15
li
-0.15
AGIC
-0.15
s
-0.15
.cm
-0.14
ampion
-0.14
POSITIVE LOGITS
itaire
0.17
-urlencoded
0.16
stalk
0.16
Å©
0.15
yne
0.14
mxArray
0.14
iner
0.14
ysize
0.14
GPC
0.14
avier
0.14
Activations Density 0.023%