INDEX
Explanations
terms related to military actions, such as bombings, attacks, and campaigns
references to military actions and war-related incidents
New Auto-Interp
Negative Logits
antha
-0.76
Lumin
-0.74
ournals
-0.73
Laure
-0.72
Cind
-0.70
Beauty
-0.70
Happiness
-0.70
Certified
-0.67
models
-0.67
ience
-0.66
POSITIVE LOGITS
retali
1.36
indiscrim
1.34
assaults
1.19
retaliation
1.19
bombardment
1.16
attacks
1.13
provocation
1.11
targeting
1.11
escalation
1.10
bombings
1.10
Activations Density 0.898%