INDEX
Explanations
keywords related to characters or personas
references to characters in narratives or games
New Auto-Interp
Negative Logits
ainer
-0.79
tesy
-0.72
ral
-0.71
washer
-0.70
vice
-0.66
jury
-0.65
loader
-0.65
usted
-0.64
ledge
-0.63
Exercise
-0.63
POSITIVE LOGITS
characters
3.73
Characters
3.03
Characters
2.57
character
2.42
protagonists
2.09
acters
2.07
Character
1.84
Character
1.79
charact
1.78
character
1.77
Activations Density 0.025%