INDEX
    Explanations

    terms related to social stigma and taboos

    New Auto-Interp
    Negative Logits
    rowse
    -0.18
    jon
    -0.17
    umble
    -0.16
    izzo
    -0.16
    rak
    -0.16
     inst
    -0.15
    ands
    -0.15
     Ion
    -0.14
    .raise
    -0.14
     Buch
    -0.14
    POSITIVE LOGITS
    åħ¸
    0.14
    ertility
    0.14
    PLOY
    0.14
    eph
    0.14
    á»Ļc
    0.14
    غاز
    0.14
    ãģ£ãģį
    0.14
    phyl
    0.14
     Yoshi
    0.13
     Ñģклада
    0.13
    Act Density 0.014%

    No Known Activations