INDEX
Explanations
The neuron selectively activates on numeric tokens and numeric data (years, counts, measurements) in the text.
New Auto-Interp
Negative Logits
-speaking
-0.07
Charm
-0.06
Stephanie
-0.06
_ptrs
-0.06
güney
-0.06
巴
-0.06
depart
-0.06
para
-0.06
preventative
-0.06
Ba
-0.06
POSITIVE LOGITS
at
0.07
` ↵
0.07
ufact
0.06
ialias
0.06
<=$
0.06
Fres
0.06
']));
0.06
'];
0.06
,readonly
0.06
./
0.06
Activations Density 0.061%