INDEX
    Explanations

    mentions of occupations and roles, particularly in professional contexts

    New Auto-Interp
    Negative Logits
     mej
    -0.16
    132
    -0.14
    iant
    -0.14
    uko
    -0.14
    echa
    -0.14
    opak
    -0.14
    iene
    -0.14
    Multiplicity
    -0.13
     Cv
    -0.13
    ycz
    -0.13
    POSITIVE LOGITS
     she
    0.19
     ê·¸ëĬĶ
    0.18
     à¤īसन
    0.17
    νÏī
    0.16
     saya
    0.16
     usted
    0.15
     he
    0.15
     you
    0.15
     она
    0.15
    ogan
    0.15
    Act Density 0.175%

    No Known Activations