INDEX
Explanations
references to significant events related to LGBTQ+ pride and social justice
New Auto-Interp
Negative Logits
767
-0.16
gne
-0.15
ori
-0.15
жд
-0.15
ponde
-0.14
teÅŁ
-0.14
Formatter
-0.14
hya
-0.14
abet
-0.13
adera
-0.13
POSITIVE LOGITS
partnership
0.15
dr
0.15
dong
0.14
ahr
0.14
long
0.14
dol
0.14
oficial
0.14
اخ
0.14
è»
0.14
licity
0.13
Activations Density 0.855%