INDEX
Explanations
phrases related to sentencing and criminal justice outcomes
New Auto-Interp
Negative Logits
Intr
-0.14
arf
-0.14
827
-0.14
amnesty
-0.14
Gilbert
-0.14
_sz
-0.14
instrumentation
-0.14
اÙĦبØŃر
-0.14
impunity
-0.14
orney
-0.13
POSITIVE LOGITS
sentence
0.20
sentences
0.17
Sentence
0.16
ikon
0.16
sentence
0.16
deal
0.16
á»ģn
0.15
ught
0.15
ventus
0.15
SHARE
0.14
Activations Density 0.029%