INDEX
Explanations
mentions of al-Qaeda and its affiliates
New Auto-Interp
Negative Logits
arts
-0.16
artz
-0.15
пÑĢиб
-0.15
ngen
-0.15
.sponge
-0.14
eff
-0.14
erox
-0.14
ovit
-0.13
èĩªæ²»
-0.13
MBER
-0.13
POSITIVE LOGITS
-Qaeda
0.25
Qaeda
0.21
queda
0.19
-Q
0.18
aeda
0.18
hur
0.18
kidd
0.17
subtype
0.16
azeera
0.16
asu
0.16
Activations Density 0.007%