INDEX
Explanations
mentions of the term "jihadi."
references to jihad or related terms associated with jihadist ideology
New Auto-Interp
Negative Logits
cms
-0.73
cast
-0.68
CAST
-0.68
ular
-0.68
rating
-0.67
tions
-0.67
DEN
-0.67
decree
-0.66
Ĥİ
-0.65
Decay
-0.65
POSITIVE LOGITS
ihadi
1.03
knife
0.85
crim
0.69
oppy
0.68
ionage
0.67
atten
0.67
ilitary
0.67
apy
0.67
ishing
0.66
xual
0.65
Activations Density 0.027%