INDEX
Explanations
terms related to competition and winning
New Auto-Interp
Negative Logits
eron
-0.19
lew
-0.18
eb
-0.17
ersen
-0.16
wine
-0.15
illon
-0.15
cki
-0.15
ASON
-0.15
iagnostics
-0.15
igli
-0.14
POSITIVE LOGITS
nable
0.27
-win
0.20
streak
0.19
emaker
0.18
ograd
0.18
-loss
0.18
/win
0.18
NER
0.18
одеÑĢж
0.17
oren
0.16
Activations Density 0.039%