INDEX
    Explanations

    references to gender, particularly male-related terms and roles

    New Auto-Interp
    Negative Logits
    setVerticalGroup
    -0.76
    AutoScaleMode
    -0.44
    Sklici
    -0.40
     ISNI
    -0.40
    ExecuteReader
    -0.39
    Groetjes
    -0.37
    viewBox
    -0.36
     CascadeType
    -0.36
     externi
    -0.34
    تفصیلات
    -0.34
    POSITIVE LOGITS
     manly
    0.78
     himself
    0.77
     masculino
    0.75
     męski
    0.73
     masculinity
    0.71
     mascul
    0.71
     masculinos
    0.71
     مردانه
    0.68
     masculina
    0.68
    Mascul
    0.68
    Act Density 1.157%

    No Known Activations