INDEX
    Explanations

    The neuron fires on polite email closings expressing thanks (e.g. “Thank you for your time and …”).

    New Auto-Interp
    Negative Logits
     trợ
    -0.07
    -0.07
    petto
    -0.06
    prung
    -0.06
    ่ง
    -0.06
     توص
    -0.06
    -0.06
    .:.:.
    -0.06
     Guys
    -0.06
    яться
    -0.06
    POSITIVE LOGITS
    'We
    0.07
    UDP
    0.07
     voi
    0.07
     capabilities
    0.07
    …)
    0.06
     toprak
    0.06
    aji
    0.06
     attendance
    0.06
     Oregon
    0.06
    raj
    0.06
    Act Density 0.005%

    No Known Activations