INDEX
Explanations
instances of the word "hitter" with activations indicating different levels of relevance
references to baseball players and their performance, particularly hitters
New Auto-Interp
Negative Logits
iment
-0.99
¥µ
-0.88
rex
-0.81
chin
-0.79
Imperium
-0.72
ulla
-0.71
iments
-0.70
ional
-0.70
qual
-0.69
flag
-0.69
POSITIVE LOGITS
batted
1.12
batters
1.07
hitters
1.07
batting
1.05
hitter
1.04
clubhouse
0.94
outfielder
0.94
pitchers
0.89
pitcher
0.87
BIP
0.87
Activations Density 0.027%