INDEX
Explanations
terms related to the LGBT community
references to the LGBTQ community and related topics
references to the LGBTQ+ community and issues related to discrimination
New Auto-Interp
Negative Logits
mington
-0.76
ither
-0.74
fulness
-0.71
lio
-0.71
lessly
-0.68
crore
-0.66
ded
-0.65
ding
-0.63
ibble
-0.63
Saving
-0.62
POSITIVE LOGITS
ugal
0.95
sect
0.79
ynski
0.79
ère
0.74
TY
0.74
alyst
0.74
ãĥ³ãĤ¸
0.73
Q
0.72
TI
0.70
Strauss
0.70
Activations Density 0.028%