INDEX
Explanations
This neuron is effectively inactive and does not detect any patterns.
New Auto-Interp
Negative Logits
kro
-0.06
initiate
-0.06
alg
-0.06
successors
-0.06
Become
-0.06
Above
-0.06
जव
-0.06
vala
-0.06
sep
-0.06
.uc
-0.06
POSITIVE LOGITS
”。↵↵
0.08
kbd
0.07
기간
0.07
SignIn
0.07
нят
0.06
Raymond
0.06
/pi
0.06
دان
0.06
RYPT
0.06
)]↵↵
0.06
Activations Density 0.027%