INDEX
Explanations
terms related to the LGBTQ+ community, specifically focusing on lesbian individuals
references to lesbian identity and experiences
New Auto-Interp
Negative Logits
rex
-0.80
EY
-0.77
ULE
-0.76
æĸ¹
-0.76
Score
-0.74
KT
-0.74
frames
-0.72
VIS
-0.69
MER
-0.69
unes
-0.68
POSITIVE LOGITS
lesbian
1.09
Lesbian
1.01
couples
0.97
bisexual
0.94
lesbians
0.94
heterosexual
0.83
sex
0.81
sexuality
0.81
equality
0.80
homosexual
0.79
Activations Density 0.007%