INDEX
    Explanations

    terms related to racial identity, particularly focusing on Black and minority populations

    New Auto-Interp
    Negative Logits
    otyping
    -0.17
    acman
    -0.16
     Borough
    -0.16
    ORIZ
    -0.15
    евид
    -0.15
     Sexe
    -0.14
    nown
    -0.14
    cribe
    -0.14
    ÑĢÑĥÑĤ
    -0.14
    pii
    -0.14
    POSITIVE LOGITS
    -owned
    0.22
    -Owned
    0.19
     Lives
    0.17
    listed
    0.16
     Led
    0.16
    ness
    0.16
    /A
    0.16
    ailed
    0.15
    Led
    0.15
    -led
    0.15
    Act Density 0.032%

    No Known Activations