INDEX
Explanations
statements related to social justice and the responsibilities of citizens
New Auto-Interp
Negative Logits
soDeliveryDate
-0.75
quickShipAvailable
-0.73
hole
-0.70
Eps
-0.66
Tarant
-0.66
rm
-0.65
orc
-0.62
script
-0.61
expired
-0.61
ipop
-0.61
POSITIVE LOGITS
understand
0.93
enjoy
0.91
abide
0.90
participate
0.90
beware
0.87
rejoice
0.84
Equality
0.82
educate
0.81
strive
0.80
obey
0.79
Activations Density 0.108%