INDEX
Explanations
mentions of explosive devices, particularly cluster bombs
references to various types of bombs
New Auto-Interp
Negative Logits
SEA
-0.83
Relations
-0.82
Stud
-0.77
TH
-0.75
Days
-0.71
WR
-0.71
Orth
-0.70
States
-0.70
Malt
-0.69
WD
-0.69
POSITIVE LOGITS
bombs
1.17
poons
1.13
detonated
1.07
deton
1.02
bombings
0.98
bomb
0.95
bomber
0.93
bombing
0.93
poon
0.90
bombed
0.89
Activations Density 0.005%