INDEX
Explanations
mentions of different sexual orientations, especially bisexuality, and related discussions or misconceptions
references to bisexuality and related sexual identities
New Auto-Interp
Negative Logits
frames
-0.83
arnaev
-0.82
atche
-0.80
urers
-0.76
alez
-0.75
assic
-0.73
ammy
-0.71
conom
-0.70
iago
-0.69
Dispatch
-0.69
POSITIVE LOGITS
osexual
1.10
ity
1.07
bisexual
0.90
couples
0.89
bian
0.85
Spectrum
0.83
sexuality
0.82
spectrum
0.80
bians
0.79
isexual
0.79
Activations Density 0.036%