INDEX
Explanations
terms related to demographics and protected characteristics, such as race, ethnicity, nationality, religion, disability, sexual orientation, gender identity, and discrimination
New Auto-Interp
Negative Logits
Cheap
-0.72
pload
-0.67
Downing
-0.66
Turing
-0.65
FedEx
-0.65
playbook
-0.64
Canaver
-0.63
Lever
-0.63
Camel
-0.63
Reviewer
-0.62
POSITIVE LOGITS
minorities
1.19
disabilities
1.06
ethnicity
1.03
LGBTQ
1.00
ethnic
1.00
LGBT
0.95
gender
0.95
ethnic
0.94
sexual
0.91
minority
0.90
Activations Density 0.190%