INDEX
Explanations
the presence of specific personal pronouns and proper nouns in the text
New Auto-Interp
Negative Logits
uppe
-0.16
enne
-0.15
ForMember
-0.14
andal
-0.14
uplic
-0.14
Drive
-0.14
211
-0.13
Aj
-0.13
beck
-0.13
ITICAL
-0.13
POSITIVE LOGITS
#
0.17
igham
0.17
åħĥ
0.15
vido
0.14
ervo
0.13
bsite
0.13
ynom
0.13
tica
0.13
Cog
0.13
amarin
0.13
Activations Density 0.048%