INDEX
    Explanations

    This neuron fires on German instructional words and phrases that ask for detailed descriptions (e.g. “beschreib … detailliert” or “ausführlich Schreibstil”).

    New Auto-Interp
    Negative Logits
    高速
    -0.06
    SAMPLE
    -0.06
    ël
    -0.06
    /cpu
    -0.06
     swept
    -0.06
     ArgumentError
    -0.06
    -0.06
     Sour
    -0.06
     filtered
    -0.06
    -part
    -0.06
    POSITIVE LOGITS
    收益
    0.07
     prose
    0.07
     kaç
    0.07
    동안
    0.07
    SETTING
    0.07
     elucid
    0.07
     cười
    0.06
     ot
    0.06
    ?"↵↵
    0.06
     typography
    0.06
    Act Density 0.025%

    No Known Activations