INDEX
Explanations
This neuron does not detect any specific content—it remains inactive (zero) across all inputs.
New Auto-Interp
Negative Logits
_boxes
-0.06
Gerald
-0.06
Guild
-0.06
Kauf
-0.06
icked
-0.06
流
-0.06
จำ
-0.06
,test
-0.06
юсь
-0.06
nama
-0.06
POSITIVE LOGITS
NSF
0.07
Behind
0.06
ріш
0.06
behind
0.06
zatímco
0.06
\"]
0.06
discouraged
0.06
состоянии
0.06
underwater
0.06
phủ
0.06
Activations Density 0.028%