INDEX
    Explanations

    different expressions of emotional states or reactions

    New Auto-Interp
    Negative Logits
    OGND
    -0.83
    NameInMap
    -0.73
    MIDDLEWARE
    -0.71
    хьтан
    -0.68
     Italijani
    -0.68
     referenties
    -0.65
    AndFlush
    -0.61
     sund
    -0.60
    \]
    -0.59
     "")
    
    -0.58
    POSITIVE LOGITS
    ={({
    0.63
    oudoune
    0.61
    DebuggerNonUser
    0.59
     tray
    0.57
    0.56
    Tray
    0.56
    :(
    0.55
     مُعرِّف
    0.54
    posisi
    0.53
    esModule
    0.53
    Act Density 0.013%

    No Known Activations