INDEX
Explanations
phrases related to negative judgment or prejudice towards a group of people
phrases related to facelessness or anonymity
New Auto-Interp
Negative Logits
Kaufman
-0.65
Vaugh
-0.60
Lyons
-0.59
wealthier
-0.59
Nicarag
-0.58
prospect
-0.58
Weir
-0.58
Shelby
-0.57
inexpensive
-0.57
DeV
-0.57
POSITIVE LOGITS
âĢ
1.43
âĢ
1.28
[/
1.26
âĶĤ
1.05
ãĢ
1.04
âľ
1.04
</
1.03
[/
1.03
É
1.02
âĺ
1.01
Activations Density 0.855%