INDEX
    Explanations

    words related to shame or shameful actions

    New Auto-Interp
    Negative Logits
     Ajax
    -0.85
     Tone
    -0.73
     Leth
    -0.67
    OLOGY
    -0.66
     Reach
    -0.66
     Luther
    -0.64
     unfocusedRange
    -0.63
    anwhile
    -0.62
     ground
    -0.61
    ãĥ¼ãĥĨãĤ£
    -0.60
    POSITIVE LOGITS
    apesh
    1.29
    atters
    1.29
    rapnel
    1.24
    apeshifter
    1.22
    oddy
    1.20
    aded
    1.19
    aders
    1.17
    attering
    1.15
    ackle
    1.13
    ippers
    1.13
    Act Density 0.018%

    No Known Activations