INDEX
Explanations
The neuron never fires on any tokens—it doesn’t detect any pattern.
New Auto-Interp
Negative Logits
submits
-0.07
ARRAY
-0.06
Hub
-0.06
เขต
-0.06
>↵↵
-0.06
よね
-0.06
Hol
-0.06
'];?>↵
-0.06
allele
-0.06
ervlet
-0.06
POSITIVE LOGITS
STRICT
0.07
future
0.06
realidad
0.06
เข
0.06
دم
0.06
0.06
атегор
0.06
mechanism
0.06
INTERN
0.06
ческое
0.06
Activations Density 0.001%