INDEX
Explanations
references to specific baseball games and player performances
New Auto-Interp
Negative Logits
asz
-0.16
urses
-0.16
tob
-0.15
BX
-0.15
avra
-0.15
defensive
-0.15
oot
-0.15
ssize
-0.15
plays
-0.15
_SZ
-0.14
POSITIVE LOGITS
pitching
0.31
Pitch
0.29
pitch
0.29
Pitch
0.28
pitch
0.27
pitches
0.26
ace
0.24
hurl
0.24
pitched
0.23
pitchers
0.23
Activations Density 0.108%