INDEX
Explanations
references to various sports, particularly baseball and basketball
New Auto-Interp
Negative Logits
football
-0.18
baseball
-0.18
basketball
-0.17
rugby
-0.17
Baseball
-0.17
hockey
-0.17
tennis
-0.16
äd
-0.16
Hockey
-0.15
adele
-0.15
POSITIVE LOGITS
/base
0.23
-playing
0.22
-reference
0.19
/Base
0.19
bum
0.16
λε
0.16
åĵ¡
0.16
/music
0.15
-related
0.15
-loving
0.15
Activations Density 0.054%