INDEX
Explanations
mentions of human rights organizations and policies
references to organizations or concepts related to human rights
New Auto-Interp
Negative Logits
Copyright
-0.59
Flavoring
-0.56
inas
-0.54
taboola
-0.54
iership
-0.53
DCS
-0.52
UFC
-0.51
Scroll
-0.51
Published
-0.51
south
-0.50
POSITIVE LOGITS
moreover
0.89
however
0.89
therefore
0.88
furthermore
0.81
meanwhile
0.79
also
0.70
anwhile
0.69
additionally
0.66
thus
0.66
though
0.60
Activations Density 2.742%