INDEX
    Explanations

    The neuron detects occurrences of the word “regardless” (as in “regardless of …”).

    New Auto-Interp
    Negative Logits
    よね
    -0.06
    bud
    -0.06
    arming
    -0.06
    Modes
    -0.06
     mph
    -0.06
    "));
    ↵
    -0.06
    valuator
    -0.06
     Jackson
    -0.06
     sina
    -0.05
    -0.05
    POSITIVE LOGITS
    โล
    0.07
     فض
    0.07
    مز
    0.06
    وره
    0.06
    uida
    0.06
    être
    0.06
     hätte
    0.06
     شک
    0.06
    idend
    0.06
    .tt
    0.06
    Act Density 0.007%

    No Known Activations