INDEX
Explanations
references to specific terrorist organizations, particularly al-Qaeda and its affiliates
New Auto-Interp
Negative Logits
.sponge
-0.17
azo
-0.15
alborg
-0.14
assi
-0.14
.sdk
-0.14
Href
-0.13
Homer
-0.13
Knox
-0.13
ODY
-0.13
ARTH
-0.13
POSITIVE LOGITS
Qaeda
0.25
-Qaeda
0.25
al
0.23
aeda
0.20
queda
0.20
Nurs
0.19
Fate
0.19
ÐĹав
0.19
_CI
0.16
Bay
0.16
Activations Density 0.014%