INDEX
Explanations
mentions of hitting something, whether it's a physical object or a metaphorical target
instances of the word "hit."
New Auto-Interp
Negative Logits
pires
-0.73
inent
-0.65
agin
-0.65
otype
-0.64
æĢ
-0.63
orld
-0.63
ç«
-0.59
UTH
-0.57
aer
-0.56
udes
-0.56
POSITIVE LOGITS
ched
1.19
boxes
0.96
ches
0.92
puberty
0.89
hardest
0.79
box
0.77
tle
0.76
milestones
0.76
chens
0.76
peak
0.75
Activations Density 0.033%