INDEX
Explanations
code snippets
This neuron fires on phrases where the author says they “found” or “came across” code or solutions online or in another thread.
New Auto-Interp
Negative Logits
입니다
-0.06
öld
-0.06
tsky
-0.06
geometric
-0.06
alloc
-0.06
ead
-0.06
Pandora
-0.06
_OLD
-0.06
case
-0.06
Finish
-0.06
POSITIVE LOGITS
(cv
0.08
/community
0.07
regul
0.07
Ellie
0.06
BaseService
0.06
.cam
0.06
ceptions
0.06
>p
0.06
Memorial
0.06
Constitution
0.06
Activations Density 0.068%