INDEX
    Explanations

    mentions of the word "Stonewall" in various contexts, particularly relating to LGBTQ+ history and the Stonewall riots

    New Auto-Interp
    Negative Logits
    loid
    -0.15
    aux
    -0.15
    ukan
    -0.15
    zan
    -0.15
    ÑĩÑĥ
    -0.15
    CR
    -0.14
     Williamson
    -0.14
    onde
    -0.14
    ÑģÑĤа
    -0.14
    imized
    -0.14
    POSITIVE LOGITS
    eware
    0.16
    warts
    0.16
    edef
    0.16
    essel
    0.15
     Äijá»ĭnh
    0.15
    eliness
    0.15
    561
    0.15
    GRES
    0.15
    ultan
    0.15
    ợ
    0.15
    Act Density 0.013%

    No Known Activations