INDEX
Explanations
words related to social issues and activism
discourse surrounding criticism and accountability in social movements
New Auto-Interp
Negative Logits
earchers
-0.96
Zurich
-0.77
refuel
-0.74
arbon
-0.74
successor
-0.73
forecast
-0.73
prepar
-0.72
cellar
-0.70
ircraft
-0.70
hiba
-0.69
POSITIVE LOGITS
misogyny
1.64
misogyn
1.64
feminists
1.62
Feminist
1.55
feminism
1.55
Femin
1.53
shaming
1.52
sexist
1.50
slurs
1.49
femin
1.47
Activations Density 1.271%