INDEX
    Explanations

    mentions of the color white within social and political contexts

    instances of the word "white" in various contexts

    New Auto-Interp
    Negative Logits
    =-=-=-=-
    -0.86
    yrinth
    -0.82
    SIGN
    -0.74
    Completed
    -0.73
    REC
    -0.72
     Inspect
    -0.72
    ATOR
    -0.70
    obbies
    -0.69
    itual
    -0.69
    rocal
    -0.69
    POSITIVE LOGITS
     supremacist
    1.25
     supremacists
    1.12
     suprem
    1.01
     white
    1.00
    lucent
    1.00
     nationalist
    0.94
    white
    0.88
     violet
    0.86
     supremacy
    0.86
     elephant
    0.83
    Act Density 0.018%

    No Known Activations