INDEX
    Explanations

    references to LGBTQ-related topics

    references to the LGBTQ+ community, particularly concerning gay rights and issues

    New Auto-Interp
    Negative Logits
     Manufacturer
    -0.74
    Condition
    -0.71
     sidx
    -0.70
     guiActiveUnfocused
    -0.70
    ufact
    -0.69
    è¦ļéĨĴ
    -0.69
    hower
    -0.69
    ç«
    -0.68
    Ct
    -0.66
    PsyNetMessage
    -0.66
    POSITIVE LOGITS
    atri
    1.01
    dar
    0.91
     couples
    0.88
    lord
    0.87
     slurs
    0.85
     marriage
    0.85
    bie
    0.82
    ened
    0.81
     rights
    0.80
     bashing
    0.80
    Act Density 0.022%

    No Known Activations