INDEX
Explanations
equals sign
This neuron is effectively dead—it never responds to any tokens and thus doesn’t detect any pattern.
New Auto-Interp
Negative Logits
.activities
-0.07
independ
-0.07
Residents
-0.06
)})↵
-0.06
DECL
-0.06
())↵
-0.06
eyJ
-0.06
774
-0.06
discussion
-0.06
)L
-0.06
POSITIVE LOGITS
圈
0.07
opyright
0.07
_theme
0.06
şekl
0.06
蛋
0.06
аза
0.06
Od
0.06
Zem
0.06
roomId
0.06
đế
0.06
Activations Density 0.024%