INDEX
    Explanations

    This neuron detects structural or formatting tokens (e.g., special markup and header identifiers) that delineate the instruction-response context.

    New Auto-Interp
    Negative Logits
     查看
    -0.07
    -0.06
    -0.06
    查询
    -0.06
    ocaust
    -0.06
     củ
    -0.06
     startPosition
    -0.06
    َال
    -0.06
    得到
    -0.06
    020
    -0.06
    POSITIVE LOGITS
     Awards
    0.07
     duties
    0.07
    -js
    0.06
    нести
    0.06
     tüket
    0.06
    -l
    0.06
     contents
    0.06
     LJ
    0.06
     MN
    0.06
    цять
    0.06
    Act Density 0.003%

    No Known Activations