INDEX
Explanations
mentions of the term "jihadi" and related words or mentions of locations associated with the term
references to extremist groups and ideologies
New Auto-Interp
Negative Logits
disinfect
-0.70
correcting
-0.64
present
-0.62
bl
-0.62
duck
-0.62
woods
-0.62
bind
-0.61
Appalach
-0.61
rods
-0.59
appra
-0.59
POSITIVE LOGITS
ihad
5.12
ihadi
1.94
azeera
1.17
imet
1.05
soever
1.04
ourney
0.99
iannopoulos
0.97
iott
0.96
abet
0.95
igi
0.94
Activations Density 0.010%