INDEX
Explanations
This neuron fires on fractional numeric tokens (decimal parts of numbers).
New Auto-Interp
Negative Logits
estimation
-0.07
BackColor
-0.06
DOMAIN
-0.06
वस
-0.06
approximation
-0.06
Population
-0.06
claims
-0.06
Rel
-0.06
AUSE
-0.06
system
-0.06
POSITIVE LOGITS
("^0.07
://'
0.07
Southeast
0.07
\"]
0.07
新的
0.07
tweeting
0.07
clair
0.07
елов
0.06
.Here
0.06
'');↵
0.06
Activations Density 0.007%