INDEX
Explanations
code/instructions
The neuron detects floating‐point numbers (decimal values) in the text.
New Auto-Interp
Negative Logits
Rod
-0.08
etections
-0.07
(in
-0.07
(level
-0.07
_PR
-0.07
モ
-0.07
Stride
-0.06
족
-0.06
_IN
-0.06
POST
-0.06
POSITIVE LOGITS
alnız
0.07
».↵↵
0.07
?'↵↵
0.07
capt
0.06
'#
0.06
0.06
>I
0.06
тяжел
0.06
">
0.06
?>"
0.06
Activations Density 0.002%