INDEX
Explanations
mentions of specific individuals involved in extremist activities
names of individuals and groups associated with conflict or violence
New Auto-Interp
Negative Logits
Dragonbound
-0.79
Voy
-0.79
Uncharted
-0.79
Deadpool
-0.78
GDDR
-0.76
Titanic
-0.73
Wilde
-0.73
mammal
-0.73
FTC
-0.72
Victorian
-0.71
POSITIVE LOGITS
awi
1.10
qqa
1.06
Islamic
1.06
iyah
1.06
abi
1.03
Islam
1.03
mosque
1.00
Islamist
0.99
Ahmad
0.98
aida
0.97
Activations Density 0.287%