INDEX
Explanations
This neuron activates on numeric tokens, identifying numbers (including integers and decimals) in the text.
New Auto-Interp
Negative Logits
},
-0.07
'},
-0.07
Origin
-0.06
variants
-0.06
_Tag
-0.06
>');
-0.06
.StatusBadRequest
-0.06
unavailable
-0.06
_TYP
-0.06
Ridley
-0.06
POSITIVE LOGITS
يمكن
0.07
BEGIN
0.07
eydi
0.06
tvoří
0.06
něž
0.06
>n
0.06
jeden
0.06
vandal
0.06
↵
0.06
прям
0.06
Activations Density 0.059%