INDEX
Explanations
terms related to fairness and equitable treatment
New Auto-Interp
Negative Logits
-0.62
MemoryWarning
-0.60
חיצוניים
-0.58
VIAF
-0.56
PositiveButton
-0.54
")");
-0.53
บค
-0.51
ectoria
-0.51
egentlig
-0.50
الإنترنت
-0.48
POSITIVE LOGITS
fair
2.89
fair
2.54
Fair
2.41
Fair
2.21
FAIR
2.16
FAIR
2.07
fairness
1.97
fairer
1.90
unfair
1.82
Fairness
1.74
Activations Density 0.172%