INDEX
Explanations
terms related to LGBTQ+ identities and advocacy
references to LGBTQ+ identities and issues
New Auto-Interp
Negative Logits
snowball
-0.56
viewer
-0.55
Mous
-0.54
Redditor
-0.51
Unch
-0.51
spur
-0.50
witz
-0.49
é¾į
-0.48
tightened
-0.48
pload
-0.48
POSITIVE LOGITS
etc
1.28
etc
1.11
â̦)
0.84
â̦
0.81
â̦
0.79
ect
0.77
â̦.
0.76
...)
0.73
&
0.70
welf
0.70
Activations Density 0.179%