INDEX
Explanations
The neuron activates on numeric literal tokens—especially decimal and fractional constants in mathematical expressions.
New Auto-Interp
Negative Logits
Hồ
-0.07
"@
-0.07
martyr
-0.07
Agility
-0.07
July
-0.07
combo
-0.07
Mold
-0.07
Audience
-0.07
성
-0.07
durumu
-0.07
POSITIVE LOGITS
various
0.07
1
0.06
ostat
0.06
داو
0.06
کردن
0.06
과의
0.06
完
0.06
프
0.06
Verify
0.06
các
0.06
Activations Density 0.008%