INDEX
    Explanations

    references to LGBTQ+ identities and related discussions

    New Auto-Interp
    Negative Logits
    ynamo
    -0.20
    Ñĥй
    -0.17
    pray
    -0.16
    oggler
    -0.16
     Spray
    -0.16
    tah
    -0.15
    inar
    -0.15
    ëı
    -0.15
    odel
    -0.14
     hydr
    -0.14
    POSITIVE LOGITS
    Immutable
    0.17
     Passive
    0.17
     Mand
    0.16
     duck
    0.16
    é¡į
    0.16
    Sy
    0.16
     Hemp
    0.16
     ducks
    0.15
     Sy
    0.15
     Sydney
    0.15
    Act Density 0.032%

    No Known Activations