INDEX
Explanations
This neuron detects structural or formatting tokens (e.g., special markup and header identifiers) that delineate the instruction-response context.
New Auto-Interp
Negative Logits
查看
-0.07
札
-0.06
╝
-0.06
查询
-0.06
ocaust
-0.06
củ
-0.06
startPosition
-0.06
َال
-0.06
得到
-0.06
020
-0.06
POSITIVE LOGITS
Awards
0.07
duties
0.07
-js
0.06
нести
0.06
tüket
0.06
-l
0.06
contents
0.06
LJ
0.06
MN
0.06
цять
0.06
Activations Density 0.003%