INDEX
    Explanations

    personal pronouns indicating possession or relationship

    first-person pronouns and their associated verbs

    New Auto-Interp
    Negative Logits
    amaz
    -0.76
     Barron
    -0.72
    ourgeois
    -0.70
    hatt
    -0.69
    geries
    -0.66
     Revolution
    -0.65
    yss
    -0.65
     Ukrain
    -0.63
    ategory
    -0.62
    inctions
    -0.61
    POSITIVE LOGITS
     deems
    0.97
     deemed
    0.94
     cherish
    0.89
     sorely
    0.86
     deem
    0.86
     hadn
    0.81
     dearly
    0.81
     cannot
    0.80
     couldn
    0.78
     knew
    0.78
    Act Density 0.169%

    No Known Activations