INDEX
    Explanations

    topics related to gender equality and women's representation

    New Auto-Interp
    Negative Logits
     taste
    -0.29
     حب
    -0.26
    couvrez
    -0.25
    TextWatcher
    -0.25
     Blut
    -0.24
     NEXT
    -0.24
     păr
    -0.24
    -0.24
     Einf
    -0.24
     indah
    -0.24
    POSITIVE LOGITS
    Women
    1.00
     Women
    0.99
     women
    0.96
     féminine
    0.93
     WOMEN
    0.93
     Gender
    0.90
    women
    0.89
     gender
    0.87
    Gender
    0.87
     feminine
    0.87
    Act Density 0.553%

    No Known Activations