INDEX
    Explanations

    words related to negative emotions or events, specifically sadness

    New Auto-Interp
    Negative Logits
    byter
    -0.78
    iltration
    -0.72
    orthy
    -0.72
    Ranked
    -0.71
    ouver
    -0.69
    RAFT
    -0.67
    VERTISEMENT
    -0.67
    ravings
    -0.64
    UID
    -0.64
    IBLE
    -0.63
    POSITIVE LOGITS
    der
    1.25
    istic
    1.08
    istically
    1.06
    omas
    1.05
    Sad
    0.96
     sad
    0.87
    imaru
    0.85
    die
    0.84
    stal
    0.82
    istical
    0.76
    Act Density 0.010%

    No Known Activations