INDEX
Explanations
references to the historical context of slavery and its implications
New Auto-Interp
Negative Logits
MLElement
-0.61
COUVER
-0.61
campista
-0.60
//};
-0.59
vertes
-0.58
Trish
-0.57
onOptions
-0.56
entance
-0.56
יצוני
-0.56
chtenstein
-0.56
POSITIVE LOGITS
expandindo
0.68
slavery
0.63
apartheid
0.61
racially
0.61
//
0.61
negro
0.56
racism
0.56
slave
0.56
Racism
0.54
racial
0.53
Activations Density 0.300%