INDEX
Explanations
references to discrimination and inequality in various contexts
New Auto-Interp
Negative Logits
незавершена
-0.57
فريبيس
-0.43
SuspendLayout
-0.43
AnimationsModule
-0.41
useAppContext
-0.41
WebElementEntity
-0.41
✭✭
-0.41
Stuhl
-0.41
ويكيميديا
-0.40
CodeAttribute
-0.40
POSITIVE LOGITS
rejected
0.64
denied
0.61
refused
0.60
rejection
0.56
Rejected
0.56
admission
0.54
reject
0.54
rejected
0.52
Rejection
0.51
denied
0.51
Activations Density 0.507%