INDEX
Explanations
concepts related to racial identity and social justice
New Auto-Interp
Negative Logits
YRO
-0.17
fak
-0.14
AFX
-0.14
šov
-0.14
еÑĩ
-0.13
ilde
-0.13
ìłĪ
-0.13
Exporter
-0.13
esin
-0.13
Encryption
-0.13
POSITIVE LOGITS
Diversity
0.32
diversity
0.32
equity
0.31
Equity
0.30
Bias
0.29
unconscious
0.27
ally
0.27
race
0.27
EDI
0.26
racial
0.26
Activations Density 0.128%