INDEX
    Explanations

    conversational expressions indicating personal opinions and experiences

    Internet slang and emoticons

    informal laughter or emoticons

    New Auto-Interp
    Negative Logits
     محفوظة
    -0.70
    ьаж
    -0.68
    ,‎
    -0.66
    InstrumentedTest
    -0.64
    . 
    -0.63
    ".
    
    -0.63
    .•
    -0.63
    \",\"
    -0.63
    SharedDtor
    -0.62
     Мексичка
    -0.62
    POSITIVE LOGITS
     lol
    1.97
     haha
    1.91
     LOL
    1.78
     ;)
    1.76
     hehe
    1.74
     ;-)
    1.73
     :)
    1.70
     Haha
    1.70
     :-)
    1.69
     hahaha
    1.65
    Act Density 0.550%

    No Known Activations