INDEX
Explanations
The neuron consistently activates on numeric tokens (digits, decimals, p‐values, etc.), effectively detecting numbers within the text.
New Auto-Interp
Negative Logits
_yaw
-0.07
姐
-0.07
-other
-0.07
ViewPager
-0.06
amet
-0.06
_chg
-0.06
-leg
-0.06
powdered
-0.06
shirt
-0.06
renting
-0.06
POSITIVE LOGITS
jourd
0.07
CTIONS
0.07
Africa
0.07
.S
0.06
privateKey
0.06
"os
0.06
criteria
0.06
्ययन
0.06
.department
0.06
=((
0.06
Activations Density 0.007%