INDEX
Explanations
verbs related to competition and victory
the concept of winning or success
New Auto-Interp
Negative Logits
resa
-0.64
arin
-0.63
alore
-0.59
rehens
-0.58
lapse
-0.57
opian
-0.56
anwhile
-0.55
rimination
-0.55
Factor
-0.54
yrinth
-0.54
POSITIVE LOGITS
now
1.10
nings
0.98
hearts
0.88
ests
0.88
prizes
0.86
trophies
0.83
ced
0.80
battles
0.79
throp
0.77
cest
0.77
Activations Density 0.061%