INDEX
Explanations
The neuron never activates on any tokens—it doesn’t detect or respond to any specific text patterns.
New Auto-Interp
Negative Logits
ctal
-0.07
albums
-0.07
今年
-0.07
Samp
-0.07
.Dep
-0.06
Shel
-0.06
comed
-0.06
dup
-0.06
польз
-0.06
.bel
-0.06
POSITIVE LOGITS
designated
0.06
ouri
0.06
non
0.06
block
0.06
revolution
0.06
.onreadystatechange
0.06
αρά
0.06
/lists
0.06
HttpRequest
0.06
** ↵
0.06
Activations Density 0.027%