INDEX
    Explanations

    references to LGBTQ+ identity and representation

    New Auto-Interp
    Negative Logits
    anken
    -0.14
    chantment
    -0.13
    mlink
    -0.13
    odial
    -0.13
     nutrient
    -0.12
    ullan
    -0.12
    ossa
    -0.12
    岡
    -0.12
    aurant
    -0.12
    uang
    -0.12
    POSITIVE LOGITS
     LGBT
    0.68
     LGBTQ
    0.66
     gay
    0.66
     queer
    0.60
     lesbian
    0.60
     Lesbian
    0.59
     homosexual
    0.58
     Gay
    0.57
     gays
    0.56
    gay
    0.56
    Act Density 0.527%

    No Known Activations