INDEX
    Explanations

    excessive negativity and expressions of dissatisfaction

    New Auto-Interp
    Negative Logits
     Madness
    -0.16
    oria
    -0.15
    _firestore
    -0.15
    ISMATCH
    -0.14
     Speedway
    -0.14
    ized
    -0.14
    hips
    -0.14
    endon
    -0.14
     madness
    -0.14
    ausal
    -0.14
    POSITIVE LOGITS
    ger
    0.20
    dest
    0.17
    GER
    0.17
     luck
    0.16
     Samar
    0.16
    stell
    0.16
    acha
    0.15
    chester
    0.15
    ening
    0.15
    -case
    0.15
    Act Density 0.078%

    No Known Activations