INDEX
    Explanations

    words related to moral judgment and value

    the concept of being 'worthy' or deserving of something

    New Auto-Interp
    Negative Logits
    cation
    -0.64
     Esk
    -0.64
     Rou
    -0.64
    esville
    -0.63
    ingo
    -0.62
    acid
    -0.62
    eteria
    -0.62
    zyme
    -0.62
    iq
    -0.61
    rief
    -0.60
    POSITIVE LOGITS
     worthy
    0.86
     deserving
    0.84
    lihood
    0.82
    minded
    0.81
     successors
    0.81
     successor
    0.79
     contenders
    0.76
    nesses
    0.76
     heirs
    0.75
     consideration
    0.73
    Act Density 0.013%

    No Known Activations