INDEX
Explanations
mentions of characters or people
characters identified as villains
New Auto-Interp
Negative Logits
adow
-0.79
iannopoulos
-0.76
Beir
-0.74
achu
-0.71
atches
-0.70
Huck
-0.69
asma
-0.68
riet
-0.68
ewski
-0.68
atisf
-0.67
POSITIVE LOGITS
Board
0.70
Rating
0.67
rated
0.64
valued
0.63
Tre
0.61
cipline
0.60
Sword
0.60
cially
0.60
);
0.60
enza
0.60
Activations Density 0.000%