INDEX
Explanations
phrases related to competitive performance and achievements
New Auto-Interp
Negative Logits
icl
-0.16
Champ
-0.15
iem
-0.15
šak
-0.14
uche
-0.14
uter
-0.14
ic
-0.14
inos
-0.13
pag
-0.13
cdr
-0.13
POSITIVE LOGITS
loe
0.16
ãĥ¼ãĥĦ
0.16
ãĥ³ãĥĨ
0.16
úsqueda
0.15
errat
0.15
STA
0.15
dzi
0.14
çĭĤ
0.14
é«
0.14
samples
0.14
Activations Density 0.011%