INDEX
Explanations
people associated with the LGBTQ community
references to marginalized social groups, particularly LGBTQ+ individuals and their experiences
New Auto-Interp
Negative Logits
Inspection
-0.76
quickShipAvailable
-0.75
Accessory
-0.73
HL
-0.72
Bloom
-0.71
Completed
-0.69
IFF
-0.69
Nut
-0.69
PDATE
-0.67
Hig
-0.66
POSITIVE LOGITS
paces
0.87
hood
0.87
everywhere
0.84
who
0.80
folk
0.76
lesbian
0.76
lesbians
0.74
disproportionately
0.73
living
0.72
patriarchy
0.71
Activations Density 0.128%