INDEX
Explanations
instances of legal sentencing and imprisonment
New Auto-Interp
Negative Logits
arresting
-0.16
Arrest
-0.16
uras
-0.16
arrests
-0.15
ardy
-0.15
ello
-0.14
arrest
-0.14
Swarm
-0.14
Marsh
-0.14
erton
-0.14
POSITIVE LOGITS
sentence
0.26
Sentence
0.22
sentenced
0.22
sentences
0.21
Sentence
0.21
sentence
0.20
life
0.19
Sent
0.17
(sentence
0.17
_sentence
0.16
Activations Density 0.062%