INDEX
Explanations
This neuron remains silent (zero activation) on all input tokens, i.e. it does not detect any meaningful pattern.
New Auto-Interp
Negative Logits
nhiệt
-0.07
socialist
-0.07
upid
-0.06
सबस
-0.06
-capital
-0.06
率
-0.06
Βα
-0.06
entarios
-0.06
Çocuk
-0.06
универ
-0.06
POSITIVE LOGITS
�
0.06
bail
0.06
':{'0.06
.pc
0.06
retrieved
0.06
henne
0.06
postupně
0.06
nul
0.06
gleich
0.06
後に
0.06
Activations Density 0.010%