INDEX
Explanations
mentions of armed actions or conflicts
instances of the word "armed."
New Auto-Interp
Negative Logits
rx
-0.97
rb
-0.77
HUD
-0.76
arget
-0.75
Remastered
-0.72
article
-0.70
Woodward
-0.69
coli
-0.68
apest
-0.67
MQ
-0.67
POSITIVE LOGITS
robbery
1.07
armed
0.92
guards
0.91
insurrection
0.91
ament
0.90
robberies
0.86
robber
0.84
robbers
0.84
coup
0.80
uprising
0.79
Activations Density 0.018%