INDEX
    Explanations

    conversational interjections or greetings

    New Auto-Interp
    Negative Logits
    aná
    -0.07
     風
    -0.07
    æĮ¯ãĤĬ
    -0.07
    yw
    -0.07
    =yes
    -0.07
    หว
    -0.07
    armacy
    -0.07
    .scalablytyped
    -0.06
    à¸Ħำ
    -0.06
    implode
    -0.06
    POSITIVE LOGITS
     even
    0.07
     prest
    0.07
     maybe
    0.07
     worked
    0.07
    206
    0.06
     just
    0.06
    atten
    0.06
    iena
    0.06
     stranger
    0.06
    170
    0.06
    Act Density 0.007%

    No Known Activations