INDEX
Explanations
words associated with support and involvement in terrorism or terrorist organizations
New Auto-Interp
Negative Logits
pletion
-0.80
ortality
-0.76
gae
-0.71
pler
-0.70
imentary
-0.69
pex
-0.66
clamation
-0.66
elight
-0.65
aqu
-0.65
pect
-0.64
POSITIVE LOGITS
terrorists
1.08
rebels
1.07
dictators
1.05
militias
1.00
insurgents
0.99
terrorist
0.98
separatist
0.98
murderous
0.98
separatists
0.96
jihadists
0.95
Activations Density 0.292%