INDEX
Explanations
references to locations or entities with 'White' in their name
mentions of the word "White" in relation to various contexts
New Auto-Interp
Negative Logits
REM
-0.83
cffffcc
-0.79
ANS
-0.78
Downloadha
-0.75
ITAL
-0.74
udeau
-0.73
olls
-0.71
vati
-0.69
ript
-0.68
APH
-0.67
POSITIVE LOGITS
Sox
1.14
caps
1.13
horse
1.10
hall
1.10
supremacist
1.05
supremacists
1.02
beard
1.01
house
0.97
bread
0.95
Sands
0.93
Activations Density 0.021%