INDEX
Explanations
proper nouns or names of individuals
pronouns related to individuals and entities in the text
New Auto-Interp
Negative Logits
));
-0.67
izo
-0.63
)))
-0.63
Zoro
-0.62
CVE
-0.61
oké
-0.60
lasting
-0.60
())
-0.59
))
-0.58
Weston
-0.57
POSITIVE LOGITS
foundland
0.86
%"
0.79
nomine
0.78
pherd
0.77
heid
0.75
ablishment
0.75
anmar
0.74
chwitz
0.73
estine
0.72
anamo
0.72
Activations Density 0.272%