INDEX
    Explanations

    mentions related to the concept of "white" in a social or political context

    references to race, specifically focusing on the concept of "white."

    New Auto-Interp
    Negative Logits
    yrinth
    -1.01
    cffffcc
    -0.93
    ategory
    -0.78
    ysis
    -0.76
    gd
    -0.76
    alg
    -0.76
    HCR
    -0.76
    interstitial
    -0.76
    rocal
    -0.74
    Sym
    -0.74
    POSITIVE LOGITS
     supremacist
    1.48
     supremacists
    1.33
     supremacy
    1.21
     nationalist
    1.10
    beard
    0.99
    bread
    0.97
     suprem
    0.95
     nationalists
    0.93
     males
    0.85
    face
    0.84
    Act Density 0.036%

    No Known Activations