INDEX
Explanations
phrases related to physical actions, particularly focusing on the verb "punch"
references to the word "punch" and its variations in various contexts
New Auto-Interp
Negative Logits
uve
-0.85
abeth
-0.75
icter
-0.71
abor
-0.70
aird
-0.69
izen
-0.64
uph
-0.63
bec
-0.62
ignty
-0.61
heimer
-0.61
POSITIVE LOGITS
bowl
1.26
holes
0.81
punch
0.80
bag
0.77
punches
0.77
bags
0.77
weed
0.76
apult
0.75
card
0.73
ting
0.73
Activations Density 0.043%