INDEX
Explanations
The neuron fires on numeric tokens—especially decimal numbers (e.g. percentages or floating‐point values).
New Auto-Interp
Negative Logits
rear
-0.07
/span
-0.06
playwright
-0.06
accreditation
-0.06
accordance
-0.06
shortcomings
-0.06
paddingBottom
-0.06
JM
-0.06
¢
-0.06
ический
-0.06
POSITIVE LOGITS
záb
0.07
.Are
0.06
انو
0.06
lối
0.06
(cd
0.06
-fin
0.06
_DISK
0.06
су
0.06
+h
0.06
bekommen
0.06
Activations Density 0.082%