INDEX
Explanations
The neuron activates on programming control‐flow keywords (like “if,” “else,” “switch,” “for,” “while”) indicating a control structure statement.
New Auto-Interp
Negative Logits
(sigma
-0.07
_perf
-0.07
Γ
-0.07
maks
-0.06
frække
-0.06
DMETHOD
-0.06
State
-0.06
production
-0.06
ото
-0.06
htonl
-0.06
POSITIVE LOGITS
+.
0.06
Appears
0.06
…………
0.06
ху
0.06
_",
0.06
ocup
0.06
。“
0.06
MITTED
0.06
hind
0.06
Hidden
0.06
Activations Density 0.014%