INDEX
    Explanations

    negative sentiments or critical statements

    New Auto-Interp
    Negative Logits
     latter
    -0.27
    ÐIJÑĢÑħÑĸвовано
    -0.19
    页éĿ¢åŃĺæ¡£å¤ĩ份
    -0.18
    ÐŁÐļ
    -0.16
    eniable
    -0.16
    phans
    -0.16
     جع
    -0.15
    longleftrightarrow
    -0.14
    Ø©
    -0.14
     CreateMap
    -0.14
    POSITIVE LOGITS
    odore
    0.24
    adays
    0.18
    _ctxt
    0.16
    ilig
    0.15
    etheless
    0.15
    ris
    0.14
    же
    0.14
    atre
    0.14
    ÑįÑĤомÑĥ
    0.14
    xiety
    0.14
    Act Density 0.200%

    No Known Activations