INDEX
Explanations
references to LGBTQ+ identity and representation
New Auto-Interp
Negative Logits
anken
-0.14
chantment
-0.13
mlink
-0.13
odial
-0.13
nutrient
-0.12
ullan
-0.12
ossa
-0.12
岡
-0.12
aurant
-0.12
uang
-0.12
POSITIVE LOGITS
LGBT
0.68
LGBTQ
0.66
gay
0.66
queer
0.60
lesbian
0.60
Lesbian
0.59
homosexual
0.58
Gay
0.57
gays
0.56
gay
0.56
Activations Density 0.527%