INDEX
    Explanations

    references to race and ethnicity

    New Auto-Interp
    Negative Logits
    rief
    -0.71
    intenance
    -0.69
    ISTRATION
    -0.69
    المناصب
    -0.68
    chaften
    -0.68
    +#+
    -0.66
    KELEY
    -0.66
    tanleria
    -0.66
     Paglinawan
    -0.66
    NOST
    -0.65
    POSITIVE LOGITS
     HttpHeaders
    0.44
    penup
    0.42
     ingenieros
    0.42
    abestanden
    0.41
     střed
    0.40
    DR
    0.40
     convenio
    0.38
     inš
    0.38
     kvinna
    0.37
    Member
    0.36
    Act Density 0.157%

    No Known Activations