INDEX
Explanations
phrases related to political activism and social justice movements
phrases indicating demands or actions related to justice and opposition
New Auto-Interp
Negative Logits
ograp
-0.78
âĨij
-0.74
oji
-0.68
availability
-0.65
ocry
-0.61
orem
-0.61
veh
-0.61
Enlarge
-0.61
oultry
-0.61
Asked
-0.59
POSITIVE LOGITS
enance
1.11
such
1.07
this
1.02
these
1.02
injustice
0.99
THIS
0.90
blindly
0.89
theirs
0.89
anything
0.89
blatant
0.88
Activations Density 0.492%