INDEX
    Explanations

    expressions of frustration and annoyance

    New Auto-Interp
    Negative Logits
       
    -0.16
     Lew
    -0.15
    lake
    -0.15
    pan
    -0.14
    care
    -0.14
    缼
    -0.14
    ัà¸Ļà¸Ĺ
    -0.14
    /functions
    -0.14
    ales
    -0.14
    art
    -0.14
    POSITIVE LOGITS
    ingly
    0.25
    /conf
    0.19
    warts
    0.18
    ovny
    0.17
    ly
    0.17
    /alert
    0.16
    ÑģÑı
    0.16
    oire
    0.15
    .selenium
    0.15
    ATAB
    0.14
    Act Density 0.048%

    No Known Activations