INDEX
Explanations
mentions of characters or people in a text
references to characters in narratives
New Auto-Interp
Negative Logits
VERTIS
-0.72
rup
-0.72
ת
-0.72
yg
-0.70
ntil
-0.67
LOCK
-0.66
Effective
-0.66
condition
-0.64
galitarian
-0.63
Tant
-0.63
POSITIVE LOGITS
acters
1.52
istically
1.17
istics
1.07
arcs
0.86
izations
0.86
characters
0.83
portraits
0.82
assassinate
0.82
isations
0.78
Characters
0.78
Activations Density 0.046%