INDEX
Explanations
This neuron detects programming/code segments (e.g. Python code blocks and their tokens) rather than normal prose.
New Auto-Interp
Negative Logits
anticipated
-0.06
runs
-0.06
desired
-0.06
tren
-0.06
凌
-0.06
指
-0.06
яс
-0.06
visited
-0.06
eleri
-0.06
êu
-0.06
POSITIVE LOGITS
ầm
0.07
แห
0.07
{},0.07
села
0.06
بج
0.06
329
0.06
spr
0.06
(targetEntity
0.06
achinery
0.06
(PDO
0.06
Activations Density 0.031%