INDEX
Explanations
This neuron remains inactive (zero‐activation) on all tokens shown, so it does not detect any specific pattern.
New Auto-Interp
Negative Logits
rite
-0.06
Gen
-0.06
(fin
-0.06
.Doc
-0.06
مح
-0.06
adan
-0.06
atroc
-0.06
follic
-0.06
getCurrent
-0.06
ومن
-0.06
POSITIVE LOGITS
indent
0.07
NVIDIA
0.07
Everyone
0.06
?)
0.06
.ForeignKey
0.06
gutter
0.06
touted
0.06
ница
0.06
eds
0.06
?$
0.06
Activations Density 0.001%