INDEX
Explanations
phrases related to human rights abuses and brutality
references to human rights violations and related terms
New Auto-Interp
Negative Logits
paren
-0.80
onym
-0.77
ramer
-0.76
ership
-0.73
ellipt
-0.73
aran
-0.72
adult
-0.71
ellar
-0.71
ode
-0.70
arten
-0.70
POSITIVE LOGITS
atrocities
0.95
brutality
0.89
abuses
0.87
inflicted
0.84
perpetrated
0.84
massacres
0.81
dehuman
0.79
massac
0.79
killings
0.73
injustice
0.73
Activations Density 0.064%