INDEX
Explanations
references to antagonists or villains in a narrative context
terms related to villains in narratives
New Auto-Interp
Negative Logits
galitarian
-0.77
sterdam
-0.73
changes
-0.73
ollen
-0.73
glas
-0.72
oday
-0.71
chedel
-0.70
ª
-0.70
ikk
-0.69
arel
-0.68
POSITIVE LOGITS
ous
1.21
villain
1.03
Bane
1.01
ously
1.00
villains
0.95
mastermind
0.94
lair
0.83
antagonist
0.79
OUS
0.76
Dracula
0.76
Activations Density 0.073%