INDEX
    Explanations

    expressions of dissatisfaction or negative sentiments about situations

    New Auto-Interp
    Negative Logits
    ÃŃn
    -0.15
    ereum
    -0.14
    rett
    -0.14
    Sharper
    -0.14
    bitrary
    -0.14
    inality
    -0.14
    uliar
    -0.13
    ansa
    -0.13
     Elias
    -0.13
    _visibility
    -0.13
    POSITIVE LOGITS
     Optim
    0.19
     optimism
    0.18
    optim
    0.18
     optimistic
    0.18
     optim
    0.18
    happy
    0.17
    AGO
    0.17
    ä¹IJ
    0.17
     happy
    0.16
     hopeful
    0.16
    Act Density 0.002%

    No Known Activations