INDEX
Explanations
words and phrases related to LGBTQ+ identities, specifically focusing on bisexuality
references to bisexuality
New Auto-Interp
Negative Logits
EMS
-0.82
urers
-0.81
Amend
-0.71
LV
-0.71
Eat
-0.71
Dept
-0.68
earchers
-0.67
witz
-0.67
OOL
-0.67
×ķ
-0.67
POSITIVE LOGITS
ity
1.09
isexual
0.84
iliary
0.82
inals
0.79
icide
0.79
bisexual
0.77
iliated
0.76
dar
0.75
citiz
0.71
ikini
0.69
Activations Density 0.010%