INDEX
    Explanations

    words related to compassion, empathy, and leniency

    concepts related to mercy and compassion

    New Auto-Interp
    Negative Logits
    yi
    -0.78
    orn
    -0.77
    ORN
    -0.76
    andals
    -0.74
    kj
    -0.72
    add
    -0.66
    need
    -0.66
    ouf
    -0.66
    ossier
    -0.65
    gars
    -0.64
    POSITIVE LOGITS
     mercy
    1.14
     pard
    0.90
     auctions
    0.80
     forgiveness
    0.76
    saf
    0.73
    efully
    0.72
     forgive
    0.70
     pardon
    0.69
     relief
    0.68
     giveaway
    0.68
    Act Density 0.011%

    No Known Activations