INDEX
Explanations
climbing trees
This neuron never activates on any meaningful token—it’s effectively “dead” and doesn’t respond to anything.
New Auto-Interp
Negative Logits
SHORT
-0.08
.mas
-0.07
_circle
-0.07
库
-0.06
overseas
-0.06
.alt
-0.06
=================================================
-0.06
_stop
-0.06
_hour
-0.06
omas
-0.06
POSITIVE LOGITS
wsp
0.06
로그램
0.06
secular
0.06
+w
0.06
xEB
0.06
,)↵
0.06
reur
0.06
Can
0.06
riet
0.06
firefox
0.06
Activations Density 0.029%