INDEX
    Explanations

    phrases and discussions around race and cultural identity

    New Auto-Interp
    Negative Logits
    eson
    -0.08
    erson
    -0.08
    roz
    -0.07
    essen
    -0.07
    leich
    -0.07
    261
    -0.07
    asil
    -0.07
    etch
    -0.07
    iros
    -0.07
     UsersController
    -0.07
    POSITIVE LOGITS
    /black
    0.08
    led
    0.07
    issant
    0.06
    LED
    0.06
     who
    0.06
    å´İ
    0.06
    /native
    0.06
    /class
    0.06
    PN
    0.05
    877
    0.05
    Act Density 0.008%

    No Known Activations