INDEX
    Explanations

    words related to feelings of helplessness and hopelessness

    expressions of helplessness and hopelessness

    New Auto-Interp
    Negative Logits
    ULT
    -0.74
    APH
    -0.71
    Downloadha
    -0.69
    ioxide
    -0.69
    代
    -0.66
    illon
    -0.66
    ucc
    -0.66
    76561
    -0.64
    itamin
    -0.64
    OTOS
    -0.64
    POSITIVE LOGITS
    ness
    2.75
    nesses
    2.27
    ly
    1.62
    NESS
    1.60
    ity
    1.25
    liness
    1.17
    itude
    1.01
    cies
    1.00
    edly
    1.00
    LY
    1.00
    Act Density 0.131%

    No Known Activations