INDEX
Explanations
references to racial and ethnic identities, particularly those related to the Black and Caribbean populations
New Auto-Interp
Negative Logits
kona
-0.47
Hoon
-0.45
rzost
-0.42
TextAppearance
-0.42
converti
-0.42
balances
-0.41
Kohn
-0.41
orial
-0.41
أل
-0.40
vig
-0.40
POSITIVE LOGITS
Jews
0.62
Latinos
0.60
Jewish
0.59
Jewish
0.58
Hispanics
0.57
Latino
0.55
Muslims
0.55
Jews
0.54
Latino
0.53
Arabs
0.52
Activations Density 0.472%