INDEX
Explanations
references to Black individuals and communities
New Auto-Interp
Negative Logits
akin
-0.17
717
-0.14
gers
-0.14
171
-0.14
Sexo
-0.14
lama
-0.14
cul
-0.14
èά
-0.14
pping
-0.14
mt
-0.14
POSITIVE LOGITS
-owned
0.25
Lives
0.24
male
0.20
owned
0.19
lives
0.19
males
0.18
men
0.18
faces
0.18
face
0.17
Owned
0.17
Activations Density 0.036%