INDEX
    Explanations

    references to gender, specifically male and female

    Categories of people differentiated by sex or gender

    New Auto-Interp
    Negative Logits
     Plin
    -0.87
     للاسماء
    -0.83
     Athenians
    -0.82
    FormTagHelper
    -0.82
     }}"></
    -0.79
    Попис
    -0.77
     Guayaquil
    -0.77
     Normdatei
    -0.74
    EDEFAULT
    -0.74
    Vidite
    -0.72
    POSITIVE LOGITS
    volent
    0.61
     Men
    0.49
    SizeF
    0.47
    BooleanField
    0.46
     n
    0.45
     keras
    0.42
    ogyn
    0.42
     singular
    0.41
     honor
    0.41
    webdriver
    0.41
    Act Density 0.161%

    No Known Activations