INDEX
Explanations
the main characters in a story or narrative
references to protagonists in narratives
New Auto-Interp
Negative Logits
ements
-0.80
assic
-0.78
sterdam
-0.76
dos
-0.73
tx
-0.72
igree
-0.72
namese
-0.70
eln
-0.70
aret
-0.69
redit
-0.69
POSITIVE LOGITS
protagonist
1.01
heroine
0.93
acters
0.88
protagonists
0.87
Ethan
0.76
Eleven
0.70
avatar
0.65
agonist
0.65
character
0.63
hei
0.63
Activations Density 0.021%