INDEX
    Explanations

    expressions of disappointment and frustration in interpersonal situations

    New Auto-Interp
    Negative Logits
    oops
    -0.16
    ká
    -0.15
     Dank
    -0.15
     Hmm
    -0.15
    _VO
    -0.15
    drv
    -0.14
     Oops
    -0.14
     åĵ
    -0.14
     Yup
    -0.14
    æĮĤ
    -0.14
    POSITIVE LOGITS
     seriously
    0.37
     come
    0.31
     Come
    0.28
    Seriously
    0.27
     Seriously
    0.27
    come
    0.26
    ser
    0.25
     why
    0.25
     c
    0.24
     serious
    0.24
    Act Density 0.298%

    No Known Activations