INDEX
    Explanations

    phrases that express personal opinions or evaluations

    New Auto-Interp
    Negative Logits
    amburger
    -0.16
    .utf
    -0.15
     наÑģÑĤ
    -0.15
    ahat
    -0.14
    urm
    -0.14
    alse
    -0.14
    ÄĻ
    -0.14
    alleng
    -0.13
    atan
    -0.13
    èĺ
    -0.13
    POSITIVE LOGITS
    fortunate
    0.15
    osis
    0.15
     attribute
    0.14
    ŀæĢ§
    0.14
    _cli
    0.14
    -uri
    0.14
     Complaint
    0.14
    agger
    0.13
     deliberate
    0.13
     intentional
    0.13
    Act Density 0.142%

    No Known Activations