INDEX
Explanations
words related to adversaries or opponents
the term "opponent" used in various contexts related to conflict or competition
New Auto-Interp
Negative Logits
olia
-0.83
Loch
-0.71
erd
-0.70
eret
-0.66
oots
-0.66
mberg
-0.66
umph
-0.65
ocratic
-0.65
itizens
-0.62
acca
-0.62
POSITIVE LOGITS
batters
0.84
foe
0.80
onent
0.80
opponent
0.79
opponents
0.70
combatants
0.69
hesis
0.68
vanquished
0.67
onents
0.67
oft
0.67
Activations Density 0.032%