INDEX
Explanations
words related to the LGBT community and LGBTQ+ rights
New Auto-Interp
Negative Logits
guiActiveUnfocused
-0.73
anke
-0.72
frames
-0.66
tenance
-0.66
manship
-0.65
lain
-0.64
oxide
-0.62
osaurs
-0.62
"$:/
-0.61
urers
-0.60
POSITIVE LOGITS
IQ
1.05
amily
0.80
rights
0.75
equality
0.74
activists
0.73
hovah
0.70
Spectrum
0.70
atri
0.70
LGBT
0.70
yre
0.69
Activations Density 5.333%