INDEX
    Explanations

    references to systemic oppression and marginalization of diverse groups

    New Auto-Interp
    Negative Logits
     صوتيه
    -0.68
    لينكات
    -0.49
     فريبيس
    -0.49
    melada
    -0.45
    XtraBars
    -0.44
    исправ
    -0.44
     RSSSF
    -0.44
     الحره
    -0.44
    apunov
    -0.42
    Prepar
    -0.41
    POSITIVE LOGITS
     racism
    1.59
     discrimination
    1.54
     racist
    1.48
     discriminatory
    1.38
     prejudice
    1.34
     Racism
    1.24
     Discrimination
    1.23
    discrimination
    1.20
     prejudices
    1.20
     racial
    1.20
    Act Density 0.889%

    No Known Activations