INDEX
Explanations
rankings and scores in sports contexts
New Auto-Interp
Negative Logits
Garner
-0.17
oren
-0.17
atar
-0.16
afil
-0.15
cken
-0.14
erson
-0.14
crt
-0.14
Dien
-0.14
ammen
-0.14
ero
-0.14
POSITIVE LOGITS
éĨ
0.15
Snyder
0.15
JK
0.14
lime
0.14
dn
0.14
ÑģÑĤаÑĤÑĥÑģ
0.14
-ranked
0.14
>\<^
0.14
kü
0.13
-sama
0.13
Activations Density 0.018%