INDEX
    Explanations

    school, class

    This neuron detects when the text mentions an academic or school-assignment context (e.g. “school,” “project,” “course”).

    New Auto-Interp
    Negative Logits
    HQ
    -0.07
     Blasio
    -0.06
    Number
    -0.06
     cg
    -0.06
     Charlie
    -0.06
    ско
    -0.06
    zoek
    -0.06
    Once
    -0.06
    _JOB
    -0.06
    _TWO
    -0.06
    POSITIVE LOGITS
     comrades
    0.07
    Τ
    0.07
     аж
    0.07
    izza
    0.06
     çeşit
    0.06
     이야
    0.06
    。不
    0.06
    0.06
     pedestrian
    0.06
    ",__
    0.06
    Act Density 0.055%

    No Known Activations