INDEX
Explanations
phrases related to acts of physical violence or assault
phrases describing violent actions or assaults
New Auto-Interp
Negative Logits
igne
-0.73
fol
-0.72
lio
-0.70
ablishment
-0.67
âĨij
-0.66
center
-0.66
lore
-0.66
improvement
-0.65
Iss
-0.64
Improvement
-0.63
POSITIVE LOGITS
gunfire
1.12
scissors
1.12
bullets
1.09
knives
1.08
fists
1.04
hamm
1.00
projectiles
1.00
grenades
0.99
gunshots
0.99
axe
0.95
Activations Density 0.132%