INDEX
Explanations
terms related to different sexual orientations, focusing especially on the concept of being straight
references to heterosexual identity and related discussions
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.69
Kard
-0.66
adle
-0.66
Lauder
-0.65
auder
-0.65
Tycoon
-0.64
mble
-0.63
Oro
-0.63
ILCS
-0.62
Ji
-0.62
POSITIVE LOGITS
bent
0.95
ibur
0.95
straight
0.92
forward
0.89
Stra
0.86
Straight
0.86
bread
0.78
FIX
0.78
away
0.77
dope
0.77
Activations Density 0.005%