INDEX
    Explanations

    terms related to fairness and equitable treatment

    New Auto-Interp
    Negative Logits
    -0.62
    MemoryWarning
    -0.60
     חיצוניים
    -0.58
    VIAF
    -0.56
    PositiveButton
    -0.54
     ")");
    -0.53
    บค
    -0.51
    ectoria
    -0.51
     egentlig
    -0.50
     الإنترنت
    -0.48
    POSITIVE LOGITS
     fair
    2.89
    fair
    2.54
    Fair
    2.41
     Fair
    2.21
     FAIR
    2.16
    FAIR
    2.07
     fairness
    1.97
     fairer
    1.90
     unfair
    1.82
     Fairness
    1.74
    Act Density 0.172%

    No Known Activations