INDEX
    Explanations

    references to women and gender equality

    New Auto-Interp
    Negative Logits
     Woman
    -0.23
     Womens
    -0.21
    ;width
    -0.21
     woman
    -0.21
     Women
    -0.20
    Woman
    -0.20
    Boy
    -0.19
     ÙĪÙĦÙĬ
    -0.19
     wrists
    -0.18
    女人
    -0.18
    POSITIVE LOGITS
     men
    0.24
     unw
    0.21
    men
    0.21
     monthly
    0.20
     girls
    0.19
    -men
    0.18
     Men
    0.17
    Month
    0.17
     Monthly
    0.17
    Monthly
    0.17
    Act Density 0.068%

    No Known Activations