INDEX
Explanations
phrases and concepts related to "winning" in various contexts
New Auto-Interp
Negative Logits
tes
-0.20
vore
-0.16
/or
-0.16
als
-0.15
uteur
-0.15
EB
-0.15
zent
-0.14
duct
-0.14
kov
-0.14
ìĤ¬íķŃ
-0.13
POSITIVE LOGITS
nable
0.20
-win
0.16
now
0.16
throp
0.14
amaño
0.14
riminator
0.14
/win
0.14
ingly
0.14
NF
0.13
agli
0.13
Activations Density 0.067%