INDEX
Explanations
phrases related to terrorist activities or individuals
references to terrorists
New Auto-Interp
Negative Logits
shire
-0.70
irth
-0.69
ership
-0.67
ered
-0.67
times
-0.67
pel
-0.65
NRS
-0.65
bred
-0.65
Wheel
-0.65
laus
-0.64
POSITIVE LOGITS
terrorists
1.07
bombers
0.97
apons
0.91
mastermind
0.90
terrorist
0.88
abad
0.88
detonated
0.85
gunmen
0.85
terrorist
0.84
attackers
0.83
Activations Density 0.010%