INDEX
Explanations
The neuron detects onomatopoetic or figurative words for striking or hitting (e.g. slap, smack, slapped).
New Auto-Interp
Negative Logits
okolí
-0.07
sớm
-0.06
أيض
-0.06
laws
-0.06
keeps
-0.06
begins
-0.06
ambient
-0.06
redi
-0.06
UU
-0.06
began
-0.06
POSITIVE LOGITS
ام
0.09
slap
0.09
slam
0.09
slash
0.09
smack
0.08
banged
0.08
bang
0.08
slapped
0.08
slashed
0.07
mashed
0.07
Activations Density 0.019%