INDEX
Explanations
references to battles and warfare
New Auto-Interp
Negative Logits
ascript
-0.80
oba
-0.70
essage
-0.63
livest
-0.62
oci
-0.62
amera
-0.62
ħĭ
-0.61
çīĪ
-0.61
ure
-0.60
gow
-0.60
POSITIVE LOGITS
roy
1.12
cru
1.10
axe
1.03
front
1.03
waged
1.01
fought
1.00
against
0.94
raged
0.93
raging
0.88
naire
0.86
Activations Density 0.018%