INDEX
Explanations
The neuron selectively activates on code-style identifier tokens—especially those containing or ending with “id” (e.g. typeid, offset).
New Auto-Interp
Negative Logits
Jos
-0.07
Taş
-0.06
notices
-0.06
Jog
-0.06
&)
-0.06
notably
-0.06
ถนน
-0.06
.Protocol
-0.06
次
-0.06
Hats
-0.06
POSITIVE LOGITS
SO
0.07
_HERE
0.06
0.06
Capt
0.06
lesia
0.06
olkata
0.06
"+↵
0.06
ázd
0.06
.geometry
0.06
natur
0.06
Activations Density 0.206%