INDEX
Explanations
words or phrases related to performance and competition
New Auto-Interp
Negative Logits
ourage
-0.16
åĨĴ
-0.14
imity
-0.14
алов
-0.14
ë¡Ģ
-0.14
åĨ
-0.14
è·Į
-0.14
hof
-0.14
åį±
-0.13
esper
-0.13
POSITIVE LOGITS
domination
0.32
dominance
0.31
cruise
0.30
rout
0.29
cru
0.28
cruising
0.28
dominating
0.28
dominant
0.28
dominate
0.27
easy
0.26
Activations Density 0.191%