INDEX
Explanations
Code and chat snippets
This neuron selectively activates on floating‐point numeric tokens (decimal numbers).
New Auto-Interp
Negative Logits
.translate
-0.07
AV
-0.07
throws
-0.07
itable
-0.06
telephone
-0.06
ря
-0.06
کنم
-0.06
.Prop
-0.06
ặ
-0.06
()(
-0.06
POSITIVE LOGITS
间
0.07
ความร
0.06
�
0.06
Augustine
0.06
679
0.06
Şah
0.06
Ann
0.06
\models
0.06
νη
0.06
━━━━━━━━
0.06
Activations Density 0.026%