INDEX
    Explanations

    discussions around LGBTQ+ rights and related events

    New Auto-Interp
    Negative Logits
    iggers
    -0.18
    igger
    -0.16
    ieri
    -0.15
    å©
    -0.15
    aise
    -0.15
    zend
    -0.14
    æ®ĸ
    -0.14
    etta
    -0.14
    AIT
    -0.14
     Inn
    -0.14
    POSITIVE LOGITS
    Ïīδ
    0.17
    анÑĤи
    0.17
    еÑĢÑĪ
    0.16
    opak
    0.15
    onte
    0.15
     Sad
    0.14
    _builtin
    0.14
     hung
    0.13
    imbus
    0.13
    Sad
    0.13
    Act Density 0.007%

    No Known Activations