INDEX
    Explanations

    computer code

    This neuron activates on words related to task‐planning and procedural steps (e.g. “tasks,” “dependencies,” “resolve,” “needed,” etc.).

    New Auto-Interp
    Negative Logits
     Knot
    -0.07
     reply
    -0.07
    Returned
    -0.07
     fruit
    -0.06
    returned
    -0.06
     eggs
    -0.06
     putting
    -0.06
    View
    -0.06
     odpově
    -0.06
    -0.06
    POSITIVE LOGITS
     Hend
    0.07
     bour
    0.06
    UMMY
    0.06
     inlet
    0.06
     مذه
    0.06
    (:,
    0.06
    omics
    0.06
     постанов
    0.06
    관련
    0.06
     ylabel
    0.06
    Act Density 0.008%

    No Known Activations