INDEX
Explanations
mentions of physical attacks or confrontations
references to individuals involved in violent actions
New Auto-Interp
Negative Logits
zl
-0.88
Balt
-0.80
umph
-0.78
Revival
-0.75
rebirth
-0.66
Rebirth
-0.65
Chart
-0.65
binding
-0.64
sit
-0.64
urgical
-0.63
POSITIVE LOGITS
attackers
0.95
attacker
0.89
assailants
0.87
gunmen
0.86
assailant
0.80
attack
0.80
beware
0.79
intent
0.79
wielding
0.79
prow
0.78
Activations Density 0.076%