INDEX
Explanations
references to LGBTQ+ themes and events, particularly related to Pride Month and associated rights
New Auto-Interp
Negative Logits
elo
-0.15
aden
-0.15
OLT
-0.15
ammable
-0.15
izio
-0.15
Sexe
-0.15
aper
-0.14
olo
-0.14
olon
-0.14
ÙĬÙĦاد
-0.14
POSITIVE LOGITS
pride
0.25
rights
0.23
IQ
0.22
-rights
0.21
-friendly
0.21
0.20
Pride
0.20
IA
0.20
QA
0.19
friendly
0.19
Activations Density 0.028%