INDEX
    Explanations

    terms related to the LGBT community

    references to the LGBTQ community and related topics

    references to the LGBTQ+ community and issues related to discrimination

    New Auto-Interp
    Negative Logits
    mington
    -0.76
    ither
    -0.74
    fulness
    -0.71
    lio
    -0.71
    lessly
    -0.68
     crore
    -0.66
    ded
    -0.65
    ding
    -0.63
    ibble
    -0.63
     Saving
    -0.62
    POSITIVE LOGITS
    ugal
    0.95
    sect
    0.79
    ynski
    0.79
    ère
    0.74
    TY
    0.74
    alyst
    0.74
    ãĥ³ãĤ¸
    0.73
    Q
    0.72
    TI
    0.70
     Strauss
    0.70
    Act Density 0.028%

    No Known Activations