INDEX
Explanations
words related to military or violent actions, specifically the word "bombing"
references to bombing events or campaigns
New Auto-Interp
Negative Logits
laus
-0.95
eva
-0.76
Lear
-0.75
learn
-0.75
mia
-0.74
BOOK
-0.74
ITY
-0.74
mis
-0.69
Tang
-0.69
dit
-0.68
POSITIVE LOGITS
bombing
1.22
bombings
1.06
raids
1.03
bomber
0.97
spree
0.97
barr
0.94
bombers
0.91
bombed
0.86
raid
0.83
bombard
0.82
Activations Density 0.017%