INDEX
Explanations
words related to conflict or confrontation with an opponent
mentions of "enemy" in various contexts
New Auto-Interp
Negative Logits
uesday
-0.79
hetti
-0.78
oled
-0.78
eret
-0.77
ajo
-0.75
atism
-0.75
aughs
-0.74
orrow
-0.72
aza
-0.71
utic
-0.71
POSITIVE LOGITS
combatants
1.09
invasion
0.84
enemy
0.83
takeover
0.82
fighters
0.80
Enemy
0.77
commander
0.76
ambush
0.75
horde
0.75
soldier
0.74
Activations Density 0.028%