INDEX
Explanations
keywords related to the September 11 terrorist attacks
references to the dates surrounding the September 11 attacks
New Auto-Interp
Negative Logits
PLIC
-0.73
mbuds
-0.68
cour
-0.65
LLOW
-0.63
itled
-0.63
blush
-0.62
reprene
-0.62
SHIP
-0.61
carrot
-0.61
veland
-0.61
POSITIVE LOGITS
11
1.01
Attacks
0.98
attacks
0.92
2001
0.88
ghazi
0.84
ocaust
0.83
Massacre
0.82
attacks
0.81
bombings
0.80
massacres
0.80
Activations Density 0.052%