INDEX
    Explanations

    terms related to different sexual orientations, focusing especially on the concept of being straight

    references to heterosexual identity and related discussions

    New Auto-Interp
    Negative Logits
    è¦ļéĨĴ
    -0.69
     Kard
    -0.66
    adle
    -0.66
     Lauder
    -0.65
    auder
    -0.65
     Tycoon
    -0.64
    mble
    -0.63
     Oro
    -0.63
     ILCS
    -0.62
     Ji
    -0.62
    POSITIVE LOGITS
    bent
    0.95
    ibur
    0.95
    straight
    0.92
    forward
    0.89
    Stra
    0.86
     Straight
    0.86
    bread
    0.78
    FIX
    0.78
    away
    0.77
     dope
    0.77
    Act Density 0.005%

    No Known Activations