INDEX
Explanations
mentions of or pertaining to transgender people
references to transgender and LGBTQ+ individuals and their related issues
New Auto-Interp
Negative Logits
aic
-0.82
Nut
-0.80
inventoryQuantity
-0.76
quickShipAvailable
-0.75
othy
-0.73
Recipe
-0.68
Autom
-0.68
HL
-0.67
Beer
-0.67
Software
-0.66
POSITIVE LOGITS
disproportionately
0.96
residing
0.93
oppressed
0.93
living
0.92
marginalized
0.92
who
0.90
hood
0.90
queer
0.88
disproportion
0.87
persecuted
0.86
Activations Density 0.239%