INDEX
Explanations
phrases related to social justice and community empowerment
New Auto-Interp
Negative Logits
Omn
-0.17
acos
-0.15
osti
-0.15
var
-0.14
VEL
-0.14
vel
-0.14
bound
-0.14
anni
-0.14
IES
-0.14
/Private
-0.14
POSITIVE LOGITS
voices
0.20
owned
0.19
female
0.18
Owned
0.17
contributions
0.16
jÃŃ
0.16
voices
0.16
entirely
0.16
majority
0.15
unheard
0.15
Activations Density 0.204%