INDEX
    Explanations

    This neuron responds to phrasing that indicates someone “has been tasked with” or assigned a responsibility.

    New Auto-Interp
    Negative Logits
     mentoring
    -0.07
    incl
    -0.06
    ,因
    -0.06
     seaborn
    -0.06
     LEFT
    -0.06
     DAY
    -0.06
    ENO
    -0.06
     pz
    -0.06
     Pamela
    -0.06
    _WALL
    -0.06
    POSITIVE LOGITS
    0.07
    г
    0.07
    0.06
    trag
    0.06
    rası
    0.06
    requ
    0.06
    kám
    0.06
    ậm
    0.06
    ční
    0.06
    üğ
    0.06
    Act Density 0.031%

    No Known Activations