INDEX
    Explanations

    emotional reactions and expressions of surprise or discomfort

    New Auto-Interp
    Negative Logits
     defaultstate
    -0.71
     debout
    -0.69
     للمعارف
    -0.69
     auroit
    -0.67
     répondu
    -0.64
    berdayakan
    -0.61
     feroit
    -0.61
    Архівовано
    -0.56
     vastaan
    -0.56
    Personensuche
    -0.55
    POSITIVE LOGITS
    parsedMessage
    0.74
    MessageTagHelper
    0.58
     '\\;'
    0.56
     autorytatywna
    0.51
    олові
    0.50
    Implement
    0.50
     disambiguazione
    0.50
     oprot
    0.48
    sizeCache
    0.48
    followed
    0.47
    Act Density 0.110%

    No Known Activations