INDEX
Explanations
terms related to violent actions and attacks on people or property
references to violence or attacks against individuals, particularly police and civilians
New Auto-Interp
Negative Logits
renaissance
-0.72
Conj
-0.66
Farming
-0.65
joining
-0.64
lopp
-0.62
join
-0.62
reinvent
-0.62
imester
-0.59
ISTORY
-0.58
okin
-0.57
POSITIVE LOGITS
targets
0.88
unarmed
0.86
itals
0.79
helpless
0.78
innocent
0.74
senseless
0.73
soever
0.72
unconscious
0.72
innoc
0.70
indiscrim
0.69
Activations Density 0.769%