INDEX
Explanations
references to systemic oppression and marginalization of diverse groups
New Auto-Interp
Negative Logits
صوتيه
-0.68
لينكات
-0.49
فريبيس
-0.49
melada
-0.45
XtraBars
-0.44
исправ
-0.44
RSSSF
-0.44
الحره
-0.44
apunov
-0.42
Prepar
-0.41
POSITIVE LOGITS
racism
1.59
discrimination
1.54
racist
1.48
discriminatory
1.38
prejudice
1.34
Racism
1.24
Discrimination
1.23
discrimination
1.20
prejudices
1.20
racial
1.20
Activations Density 0.889%