INDEX
Explanations
references to civil rights and social justice struggles
New Auto-Interp
Negative Logits
ozo
-0.16
amax
-0.15
.fm
-0.15
alist
-0.15
ihan
-0.14
IRECT
-0.14
dio
-0.14
ski
-0.14
DEPEND
-0.14
stab
-0.14
POSITIVE LOGITS
civil
0.22
196
0.21
ML
0.21
Civil
0.20
Martin
0.20
Martin
0.18
African
0.17
Southern
0.17
race
0.17
southern
0.16
Activations Density 0.108%