INDEX
    Explanations

    mentions of women or gender-related topics

    references to women and gender issues

    New Auto-Interp
    Negative Logits
    REDACTED
    -0.86
    opher
    -0.76
    asper
    -0.76
    anoia
    -0.75
    ebus
    -0.75
    rador
    -0.75
    eme
    -0.74
    ype
    -0.72
    RAY
    -0.72
    UFF
    -0.71
    POSITIVE LOGITS
    folk
    1.21
     empowerment
    1.07
     genital
    1.00
     reproductive
    0.92
    opausal
    0.88
     breasts
    0.87
     menstru
    0.87
     genitals
    0.86
     entrepreneurs
    0.81
     nipples
    0.80
    Act Density 0.086%

    No Known Activations