INDEX
Explanations
This neuron activates on numeric tokens, especially floating‐point numbers (decimal values).
New Auto-Interp
Negative Logits
KING
-0.07
cosa
-0.07
들
-0.06
spect
-0.06
endregion
-0.06
ROW
-0.06
вход
-0.06
Porter
-0.06
冠
-0.06
eurs
-0.06
POSITIVE LOGITS
افزار
0.09
'};↵
0.07
.setStyle
0.07
Vimeo
0.06
>');↵
0.06
بلغ
0.06
getDefault
0.06
-к
0.06
.SingleOrDefault
0.06
.pageSize
0.06
Activations Density 0.001%