INDEX
    Explanations

    phrases related to societal norms, beliefs, professions, citizenship, and roles

    terms related to social roles and identities

    New Auto-Interp
    Negative Logits
    forestation
    -0.64
     unbeliev
    -0.61
     insanity
    -0.60
    ãĥį
    -0.60
    plane
    -0.59
    osphere
    -0.59
    osi
    -0.58
     Sense
    -0.57
     vacancy
    -0.56
    Availability
    -0.56
    POSITIVE LOGITS
    hips
    1.20
    mith
    1.12
     themselves
    1.12
    hip
    1.06
    pring
    0.95
    paces
    0.91
     unto
    0.91
    cale
    0.89
    ometimes
    0.88
    cript
    0.87
    Act Density 0.254%

    No Known Activations