INDEX
    Explanations

    verbs or nouns that convey a strong sense of challenging established norms or beliefs

    terms that indicate defiance or resistance against norms or systems

    New Auto-Interp
    Negative Logits
     praying
    -0.72
     dressing
    -0.69
     lug
    -0.69
     donating
    -0.68
     booking
    -0.66
     scra
    -0.65
     withdrawing
    -0.64
     parting
    -0.64
     sweating
    -0.64
     renting
    -0.63
    POSITIVE LOGITS
    uates
    1.17
    ifies
    1.12
    olves
    1.08
    iates
    1.04
    icates
    1.04
    etheless
    1.03
    tains
    1.01
    cludes
    0.98
    ounded
    0.97
    ould
    0.94
    Act Density 0.165%

    No Known Activations