INDEX
    Explanations

    words related to negative emotions, such as disappointment, disgust, and disaster

    words related to disappointment or negative sentiments

    New Auto-Interp
    Negative Logits
     Kinnikuman
    -0.76
    akeru
    -0.74
    glers
    -0.73
    hetti
    -0.70
     Juliet
    -0.65
     Lans
    -0.63
    Reviewer
    -0.61
     Hod
    -0.61
     Herm
    -0.61
     Goth
    -0.61
    POSITIVE LOGITS
    ruption
    1.07
    cipl
    1.04
    rup
    1.03
    placed
    1.01
    comfort
    0.99
    abled
    0.98
    puted
    0.97
    ciples
    0.96
    licted
    0.95
    rupt
    0.95
    Act Density 0.007%

    No Known Activations