INDEX
    Explanations

    concepts related to racial identity and social justice

    New Auto-Interp
    Negative Logits
    YRO
    -0.17
    fak
    -0.14
    AFX
    -0.14
    šov
    -0.14
    еÑĩ
    -0.13
    ilde
    -0.13
    ìłĪ
    -0.13
    Exporter
    -0.13
    esin
    -0.13
    Encryption
    -0.13
    POSITIVE LOGITS
     Diversity
    0.32
     diversity
    0.32
     equity
    0.31
     Equity
    0.30
     Bias
    0.29
     unconscious
    0.27
     ally
    0.27
     race
    0.27
    EDI
    0.26
     racial
    0.26
    Act Density 0.128%

    No Known Activations