INDEX
Explanations
mentions of the color white within social and political contexts
instances of the word "white" in various contexts
New Auto-Interp
Negative Logits
=-=-=-=-
-0.86
yrinth
-0.82
SIGN
-0.74
Completed
-0.73
REC
-0.72
Inspect
-0.72
ATOR
-0.70
obbies
-0.69
itual
-0.69
rocal
-0.69
POSITIVE LOGITS
supremacist
1.25
supremacists
1.12
suprem
1.01
white
1.00
lucent
1.00
nationalist
0.94
white
0.88
violet
0.86
supremacy
0.86
elephant
0.83
Activations Density 0.018%