INDEX
Explanations
The neuron consistently lights up on tokens that are floating-point numbers (i.e. numeric strings containing a decimal point).
New Auto-Interp
Negative Logits
35
-0.07
reader
-0.07
-back
-0.06
ears
-0.06
loop
-0.06
vaccine
-0.06
Dungeons
-0.06
Bands
-0.06
(o
-0.06
Floyd
-0.06
POSITIVE LOGITS
{?}0.06
DAC
0.06
.&
0.06
forgetting
0.06
&(
0.06
"',↵
0.06
‘
0.06
modelName
0.06
_BAND
0.06
چگونه
0.06
Activations Density 0.321%