INDEX
Explanations
terms related to discrimination and inequality based on various characteristics such as race, color, religion, sex, and other protected classes
New Auto-Interp
Negative Logits
فريبيس
-0.87
estekak
-0.78
DockStyle
-0.66
migrationBuilder
-0.65
Sandstone
-0.64
fohlen
-0.63
Искәрмәләр
-0.63
//});
-0.61
RenderAtEndOf
-0.61
للمعارف
-0.61
POSITIVE LOGITS
ethnicity
0.72
gender
0.68
nationality
0.66
interests
0.63
ethnic
0.63
status
0.62
interesses
0.60
gender
0.59
Ethnicity
0.59
Nationality
0.59
Activations Density 0.447%