INDEX
    Explanations

    terms related to demographics and protected characteristics, such as race, ethnicity, nationality, religion, disability, sexual orientation, gender identity, and discrimination

    New Auto-Interp
    Negative Logits
     Cheap
    -0.72
    pload
    -0.67
     Downing
    -0.66
     Turing
    -0.65
     FedEx
    -0.65
     playbook
    -0.64
     Canaver
    -0.63
     Lever
    -0.63
     Camel
    -0.63
    Reviewer
    -0.62
    POSITIVE LOGITS
     minorities
    1.19
     disabilities
    1.06
     ethnicity
    1.03
     LGBTQ
    1.00
     ethnic
    1.00
    LGBT
    0.95
    gender
    0.95
    ethnic
    0.94
    sexual
    0.91
     minority
    0.90
    Act Density 0.190%

    No Known Activations