INDEX
Explanations
instances of strong competition or opposition
terms related to competition and participation in various contexts
New Auto-Interp
Negative Logits
atron
-0.90
mas
-0.88
bia
-0.84
notice
-0.83
chemy
-0.82
ascript
-0.82
iness
-0.80
imet
-0.79
ternity
-0.79
nery
-0.77
POSITIVE LOGITS
alcoholic
0.84
agents
0.84
spouses
0.82
entities
0.80
heights
0.80
partners
0.76
adults
0.76
factions
0.75
competitors
0.75
viewpoints
0.75
Activations Density 0.185%