INDEX
Explanations
The neuron is responding to numeric literals (integer or decimal) appearing in the text.
New Auto-Interp
Negative Logits
DIR
-0.07
MYSQL
-0.07
ingen
-0.07
-0.07
_ACTIV
-0.06
قض
-0.06
และม
-0.06
,↵↵
-0.06
,↵
-0.06
... ↵
-0.06
POSITIVE LOGITS
asy
0.07
Bers
0.07
Personal
0.07
lear
0.07
Rad
0.06
_CAP
0.06
inter
0.06
TableView
0.06
Actor
0.06
reciprocal
0.06
Activations Density 0.005%