INDEX
Explanations
The neuron activates on numerical tokens—especially decimal numbers—embedded in technical text.
New Auto-Interp
Negative Logits
수상
-0.07
Formatted
-0.06
zaw
-0.06
Alone
-0.06
Enumerator
-0.06
GOR
-0.06
truyền
-0.06
پرس
-0.06
حذ
-0.06
šla
-0.06
POSITIVE LOGITS
dé
0.07
диаг
0.07
tidak
0.06
мотря
0.06
Nos
0.06
DEAL
0.06
variable
0.06
_MED
0.06
adv
0.06
itos
0.06
Activations Density 0.024%