INDEX
Explanations
This neuron detects the special header‐start token (“<|start_header_id|>”) that marks metadata sections in the text.
New Auto-Interp
Negative Logits
触
-0.06
ise
-0.06
�
-0.06
nhân
-0.06
wp
-0.06
_TYP
-0.06
Snap
-0.06
Terminator
-0.06
producer
-0.06
ationToken
-0.06
POSITIVE LOGITS
.game
0.07
PO
0.06
gehen
0.06
(todo
0.06
pís
0.06
playoff
0.06
rápido
0.06
DirectoryInfo
0.06
soaring
0.06
-educated
0.06
Activations Density 0.307%