INDEX
Explanations
references to discrimination and mistreatment based on race or gender
New Auto-Interp
Negative Logits
cref
-0.58
gebene
-0.55
abil
-0.54
uolo
-0.53
ctible
-0.51
setDefault
-0.51
issements
-0.50
__["
-0.50
rawDesc
-0.49
veur
-0.49
POSITIVE LOGITS
recevoir
0.71
receive
0.69
Receiving
0.68
receives
0.66
receiving
0.65
received
0.65
Receive
0.63
received
0.63
Receives
0.62
MigrationBuilder
0.62
Activations Density 0.692%