INDEX
    Explanations

    expressions of strong emotion or dramatic statements

    New Auto-Interp
    Negative Logits
    Beschreibung
    -0.31
    省市镇
    -0.28
     상세
    -0.28
     akt
    -0.26
     commun
    -0.25
     Grit
    -0.25
    是大
    -0.25
    зонта
    -0.24
     chapa
    -0.24
    -0.23
    POSITIVE LOGITS
     joke
    0.68
     المعيارى
    0.68
     sarcas
    0.68
     jokingly
    0.67
     joking
    0.67
    Humor
    0.67
     sarcasm
    0.66
    AndEndTag
    0.65
     chuckle
    0.65
     laughter
    0.64
    Act Density 0.065%

    No Known Activations