INDEX
    Explanations

    themes related to social justice and collective experiences of marginalized communities

    New Auto-Interp
    Negative Logits
    CLU
    -0.16
    hev
    -0.15
     Graz
    -0.15
     ει
    -0.14
     manned
    -0.14
    zen
    -0.14
    ivil
    -0.14
    å¾®ç¬ij
    -0.13
    CSR
    -0.13
     Lisp
    -0.13
    POSITIVE LOGITS
     dec
    0.20
    -archive
    0.19
    archives
    0.17
     que
    0.17
     femme
    0.17
     Black
    0.17
    icolon
    0.17
    iglia
    0.16
    .archive
    0.16
    ipt
    0.16
    Act Density 0.009%

    No Known Activations