INDEX
    Explanations

    topics related to gun control and LGBTQ+ rights

    New Auto-Interp
    Negative Logits
    izer
    -0.28
    ized
    -0.26
    ization
    -0.25
    eer
    -0.24
    ously
    -0.24
    izers
    -0.24
    ize
    -0.23
    naire
    -0.22
    hip
    -0.20
    aires
    -0.20
    POSITIVE LOGITS
    ãĢħ
    0.20
    ery
    0.18
    istry
    0.17
    shot
    0.17
    //{{
    0.17
    iness
    0.17
    yb
    0.17
    tober
    0.16
    rey
    0.16
    linger
    0.16
    Act Density 0.638%

    No Known Activations