INDEX
Explanations
words related to military airstrikes
references to air strikes
New Auto-Interp
Negative Logits
ħĭ
-0.70
Lomb
-0.70
Wealth
-0.70
æĥ
-0.69
Bagg
-0.67
Barber
-0.67
åİ
-0.64
ITY
-0.63
Argon
-0.62
ðĿ
-0.62
POSITIVE LOGITS
strikes
0.89
targeting
0.83
strike
0.83
achi
0.81
munitions
0.78
targeted
0.77
aimed
0.75
airstrikes
0.75
force
0.74
force
0.74
Activations Density 0.019%