INDEX
Explanations
phrases related to holding individuals or groups accountable
references to specific events or actions related to advocacy and social issues
New Auto-Interp
Negative Logits
)=(
-0.62
Canaver
-0.59
etheless
-0.56
archived
-0.52
summed
-0.50
awaru
-0.49
historian
-0.49
Trace
-0.46
angled
-0.46
heterogeneity
-0.46
POSITIVE LOGITS
pers
0.62
vers
0.60
.",
0.59
adium
0.58
ankind
0.57
?",
0.56
verage
0.56
lees
0.55
nels
0.54
xia
0.54
Activations Density 1.311%