INDEX
Explanations
This neuron activates on floating-point numeric tokens (decimal numbers with a “.”).
New Auto-Interp
Negative Logits
.keywords
-0.07
neste
-0.06
ter
-0.06
�
-0.06
ters
-0.06
oker
-0.06
중요한
-0.06
Peace
-0.06
curs
-0.06
θη
-0.06
POSITIVE LOGITS
.rstrip
0.06
agency
0.06
แห
0.06
Began
0.06
-nine
0.06
inmates
0.06
taille
0.06
GRAPH
0.06
Scottish
0.06
’ı
0.06
Activations Density 0.011%