INDEX
Explanations
mention of sexual orientations, specifically individuals who identify as bisexual
references to bisexuality
New Auto-Interp
Negative Logits
EMS
-0.88
LV
-0.81
Eat
-0.79
keeper
-0.74
hig
-0.73
XT
-0.70
DoS
-0.70
steen
-0.68
Downloadha
-0.68
books
-0.67
POSITIVE LOGITS
ity
1.38
ities
1.00
couples
0.90
iated
0.89
iliary
0.82
volent
0.79
males
0.77
inals
0.77
iliated
0.75
osexual
0.75
Activations Density 0.030%