INDEX
Explanations
references to home runs in baseball
New Auto-Interp
Negative Logits
ify
-0.17
ideshow
-0.17
intree
-0.16
ments
-0.16
holm
-0.15
ÑĢазд
-0.15
mts
-0.15
ìŀ¡
-0.15
Pitch
-0.14
landa
-0.14
POSITIVE LOGITS
plate
0.20
plate
0.20
coming
0.17
uns
0.17
Run
0.16
run
0.16
Runs
0.16
Plate
0.16
opus
0.15
runs
0.15
Activations Density 0.005%