INDEX
Explanations
The neuron responds to numeric tokens (including integers and decimals) in the text.
New Auto-Interp
Negative Logits
.Axis
-0.07
πολι
-0.06
Erect
-0.06
потер
-0.06
Brushes
-0.06
.mob
-0.06
csrf
-0.06
GitHub
-0.06
oftware
-0.06
.prop
-0.06
POSITIVE LOGITS
vably
0.06
hobbies
0.06
lications
0.06
Taste
0.06
MANUAL
0.06
sights
0.06
believing
0.06
mA
0.06
BIND
0.05
vable
0.05
Activations Density 0.020%