INDEX
Explanations
The neuron activates on numeric literals in source code, i.e. tokens representing numbers.
New Auto-Interp
Negative Logits
cursor
-0.07
lahoma
-0.06
ляється
-0.06
_Left
-0.06
дом
-0.06
облі
-0.06
_append
-0.06
Annunci
-0.06
,ev
-0.06
Left
-0.06
POSITIVE LOGITS
pot
0.06
aped
0.06
-paced
0.06
↵ ↵ ↵ ↵
0.06
guns
0.06
(torch
0.06
Deploy
0.06
backs
0.06
значительно
0.06
强
0.06
Activations Density 0.006%