INDEX
Explanations
mentions related to the concept of "white" in a social or political context
references to race, specifically focusing on the concept of "white."
New Auto-Interp
Negative Logits
yrinth
-1.01
cffffcc
-0.93
ategory
-0.78
ysis
-0.76
gd
-0.76
alg
-0.76
HCR
-0.76
interstitial
-0.76
rocal
-0.74
Sym
-0.74
POSITIVE LOGITS
supremacist
1.48
supremacists
1.33
supremacy
1.21
nationalist
1.10
beard
0.99
bread
0.97
suprem
0.95
nationalists
0.93
males
0.85
face
0.84
Activations Density 0.036%