INDEX
Explanations
mentions of armed conflict or battles
instances of the word "fighting" and related terms
New Auto-Interp
Negative Logits
ħĭ
-0.80
æĺ¯
-0.74
uras
-0.72
ographical
-0.70
ophile
-0.69
ocl
-0.69
OGR
-0.68
Reviewer
-0.67
gow
-0.66
Starship
-0.66
POSITIVE LOGITS
fighting
1.21
raged
1.00
fights
0.97
waged
0.97
fighters
0.96
fatig
0.93
against
0.92
brig
0.87
alongside
0.85
forces
0.84
Activations Density 0.052%