INDEX
Explanations
This neuron activates on numeric tokens, especially decimal numbers and other numerical expressions.
New Auto-Interp
Negative Logits
button
-0.07
IRE
-0.06
function
-0.06
�
-0.06
retiring
-0.06
%
-0.06
McD
-0.06
Pass
-0.06
ainda
-0.06
�
-0.06
POSITIVE LOGITS
.publisher
0.07
wert
0.06
mağ
0.06
Shane
0.06
ProgressBar
0.06
-cover
0.06
방
0.06
empl
0.05
små
0.05
lux
0.05
Activations Density 0.081%