INDEX
Explanations
code snippets
The neuron activates on numeric literal tokens (e.g. integers or floating‐point numbers).
New Auto-Interp
Negative Logits
ninth
-0.07
again
-0.07
mars
-0.07
дея
-0.06
material
-0.06
itals
-0.06
(Session
-0.06
Granted
-0.06
goal
-0.06
mates
-0.06
POSITIVE LOGITS
volte
0.07
μο
0.07
Charlie
0.06
cleansing
0.06
_CHARS
0.06
_palette
0.06
Illustr
0.05
colonial
0.05
�
0.05
�
0.05
Activations Density 0.038%