INDEX
    Explanations

    themes related to systemic issues and social justice

    New Auto-Interp
    Negative Logits
    pector
    -0.17
    aggio
    -0.17
    undy
    -0.16
    ureau
    -0.16
    ovol
    -0.16
    mercial
    -0.15
    rál
    -0.14
    atsu
    -0.14
    ARAM
    -0.14
    jo
    -0.14
    POSITIVE LOGITS
    оÑĢаз
    0.17
    kat
    0.15
    ething
    0.15
    nder
    0.15
    ¿
    0.14
    izzie
    0.14
    uplic
    0.13
    acher
    0.13
    ikt
    0.13
    Ĵ
    0.13
    Act Density 0.467%

    No Known Activations