INDEX
Explanations
references to historical and modern slavery, particularly emphasizing the negative aspects and impact
references to the concept of slavery
New Auto-Interp
Negative Logits
erb
-0.81
Editors
-0.77
rons
-0.76
cit
-0.74
acl
-0.73
icles
-0.72
kj
-0.69
liners
-0.68
soDeliveryDate
-0.67
bits
-0.67
POSITIVE LOGITS
slavery
1.19
plantation
0.96
avery
0.90
enslaved
0.88
plantations
0.88
slaves
0.88
ensl
0.87
avement
0.83
slave
0.81
dehuman
0.77
Activations Density 0.030%