INDEX
    Explanations

    phrases related to men's issues or movements

    references to gender-related topics, specifically focusing on women's issues

    New Auto-Interp
    Negative Logits
    SIGN
    -0.73
    OVER
    -0.66
     snail
    -0.63
    ļéĨĴ
    -0.60
     Leilan
    -0.58
    SET
    -0.57
     commons
    -0.56
     Heist
    -0.55
    EStream
    -0.55
     Presidents
    -0.54
    POSITIVE LOGITS
    atisf
    1.02
    kaya
    0.93
    outhern
    0.91
    lightly
    0.90
    pecially
    0.90
    wered
    0.89
    atellite
    0.89
    bestos
    0.89
    ween
    0.87
    omew
    0.87
    Act Density 0.190%

    No Known Activations