INDEX
Explanations
references to the Taliban
mentions of the Taliban
New Auto-Interp
Negative Logits
abet
-0.71
yss
-0.71
secut
-0.71
isner
-0.69
efer
-0.68
gas
-0.68
++++++++++++++++
-0.67
leon
-0.66
OHN
-0.66
Quadro
-0.65
POSITIVE LOGITS
insurgents
1.17
istani
1.14
insurgency
1.04
Taliban
1.00
istan
0.97
insurgent
0.90
Afghan
0.82
Haram
0.81
militants
0.80
infiltr
0.78
Activations Density 0.007%