INDEX
Explanations
sorting numbers
This neuron activates on numeric tokens (digits, signs, decimal points, and fractions), i.e. it detects numbers.
New Auto-Interp
Negative Logits
esture
-0.06
remembers
-0.06
pas
-0.06
IMP
-0.06
шлях
-0.06
rend
-0.06
66
-0.06
px
-0.06
ippi
-0.06
_Dep
-0.06
POSITIVE LOGITS
iets
0.07
."));↵
0.06
siyaset
0.06
tics
0.06
Лі
0.06
$res
0.06
se
0.06
Flow
0.06
_nt
0.06
'};↵
0.06
Activations Density 0.007%