INDEX
Explanations
words related to sexual orientation and relationship status, particularly focusing on heterosexuality and monogamous relationships
references to heterosexuality and related concepts
New Auto-Interp
Negative Logits
former
-0.76
ht
-0.74
hod
-0.73
Downloadha
-0.71
abba
-0.69
chief
-0.67
adium
-0.67
Barkley
-0.67
hig
-0.66
umi
-0.66
POSITIVE LOGITS
nesday
1.01
monog
0.87
ity
0.84
eties
0.82
inant
0.80
couples
0.78
soever
0.77
ility
0.73
heterosexual
0.73
minded
0.72
Activations Density 0.023%