INDEX
Explanations
phrases related to societal norms and biases, especially institutionalized ones
topics related to institutionalized biases and societal norms
New Auto-Interp
Negative Logits
endment
-0.92
bernatorial
-0.87
etsk
-0.84
earch
-0.80
Gerr
-0.78
hire
-0.76
shall
-0.76
eday
-0.75
éĹ
-0.74
leased
-0.74
POSITIVE LOGITS
stereotypes
1.62
prejudices
1.57
stereotype
1.53
precon
1.48
patriarchal
1.46
dehuman
1.43
ingrained
1.43
prejudice
1.41
stigma
1.41
subconscious
1.41
Activations Density 0.662%