INDEX
Explanations
phrases related to violence and aggression
references to war or violence
New Auto-Interp
Negative Logits
ãĤ´ãĥ³
-0.89
atown
-0.71
ruary
-0.70
ĸļ
-0.69
cribed
-0.69
Volunte
-0.68
ISTORY
-0.67
olkien
-0.66
anship
-0.65
zed
-0.65
POSITIVE LOGITS
crap
1.16
entire
1.12
messenger
1.09
offending
1.06
throats
1.01
intruder
1.01
opponent
1.00
unsuspecting
0.97
innocent
0.94
enemy
0.93
Activations Density 0.266%