INDEX
Explanations
phrases related to sentencing and judicial consequences
New Auto-Interp
Head Attr Weights
0:0.01
1:0.01
2:0.07
3:0.06
4:0.06
5:0.02
6:0.04
7:0.44
8:0.02
9:0.03
10:0.10
11:0.11
Negative Logits
soType
-1.51
Collider
-1.46
)</
-1.43
IRT
-1.41
nergy
-1.39
idem
-1.38
similarities
-1.37
alogy
-1.37
Forums
-1.36
ahoo
-1.29
POSITIVE LOGITS
manslaughter
1.82
incarcer
1.62
jailed
1.58
outstanding
1.55
imprisonment
1.53
charges
1.52
sentences
1.50
rapes
1.46
penalties
1.45
arrest
1.43
Activations Density 0.015%