INDEX
Explanations
words related to various forms and aspects of oppression
terms related to oppression and marginalized groups
New Auto-Interp
Negative Logits
WAYS
-0.77
tein
-0.70
cigarettes
-0.70
cephal
-0.68
Dub
-0.67
PET
-0.67
Lew
-0.65
uchin
-0.65
SEE
-0.64
sal
-0.64
POSITIVE LOGITS
oppression
1.04
oppress
1.03
oppressed
0.88
disadvant
0.82
retched
0.79
Struggle
0.76
injust
0.74
Palest
0.74
oppressive
0.72
injustice
0.72
Activations Density 0.016%