INDEX
    Explanations

    phrases related to different professions and roles

    identifiers related to personal roles or identities

    New Auto-Interp
    Negative Logits
     \'
    -0.74
    ombies
    -0.61
    videos
    -0.61
    rists
    -0.60
    Stars
    -0.60
    >]
    -0.59
    odo
    -0.57
    COMPLE
    -0.57
    åħī
    -0.57
    andals
    -0.57
    POSITIVE LOGITS
     myself
    1.29
     yourself
    0.91
     oneself
    0.89
     ourselves
    0.89
     accustomed
    0.84
     educator
    0.82
     grows
    0.78
     I
    0.77
     we
    0.77
     yourselves
    0.75
    Act Density 0.114%

    No Known Activations