INDEX
Explanations
references to sports teams and their performance
New Auto-Interp
Negative Logits
ÑģÑĤÑĢе
-0.15
leston
-0.14
RL
-0.14
strings
-0.14
peaker
-0.14
eru
-0.14
tember
-0.14
onis
-0.14
arian
-0.13
ÑĤеÑĩ
-0.13
POSITIVE LOGITS
Bott
0.15
erti
0.15
oro
0.14
aura
0.14
Bhar
0.14
/Runtime
0.14
Todd
0.13
nr
0.13
airy
0.13
ghost
0.13
Activations Density 0.029%