INDEX
Explanations
Code/metadata
The neuron primarily detects the special end‐of‐text marker token (“<|eot_id|>”).
New Auto-Interp
Negative Logits
setStatus
-0.07
(icon
-0.07
dělen
-0.07
yaz
-0.06
igth
-0.06
bal
-0.06
آز
-0.06
yang
-0.06
brightness
-0.06
(bin
-0.06
POSITIVE LOGITS
κε
0.07
believes
0.07
epis
0.07
farther
0.07
/mysql
0.06
inea
0.06
subscribe
0.06
ِب
0.06
.au
0.06
believe
0.06
Activations Density 0.051%