INDEX
Explanations
mentions of terrorist activities or threats
references to hijacking events
New Auto-Interp
Negative Logits
Rite
-0.89
bleacher
-0.71
Thick
-0.67
Solitaire
-0.67
Veter
-0.66
20439
-0.66
Angels
-0.64
ç¥ŀ
-0.64
rose
-0.63
Bears
-0.63
POSITIVE LOGITS
hij
1.18
hijacked
0.99
ackers
0.83
intosh
0.82
aline
0.80
ulence
0.78
acking
0.74
inx
0.73
eering
0.72
anism
0.70
Activations Density 0.019%