INDEX
Explanations
instances of physical violence or injury
New Auto-Interp
Negative Logits
ghai
-0.71
igne
-0.69
shire
-0.66
âĨij
-0.65
ablishment
-0.65
ilight
-0.65
amental
-0.63
idity
-0.63
former
-0.61
iance
-0.60
POSITIVE LOGITS
impunity
1.08
gunfire
0.96
gust
0.95
fists
0.90
regards
0.87
bullets
0.86
stood
0.85
arrows
0.83
scissors
0.82
missiles
0.81
Activations Density 0.047%