INDEX
Explanations
sentences related to legal proceedings or imprisonment
references to prison sentences
New Auto-Interp
Negative Logits
sie
-1.00
BLIC
-0.88
NetMessage
-0.72
chio
-0.72
aucus
-0.71
sonian
-0.71
rous
-0.70
endor
-0.69
aucuses
-0.67
sylv
-0.66
POSITIVE LOGITS
sentences
1.11
sentence
0.98
probation
0.83
imposed
0.83
sentencing
0.82
ishment
0.78
summ
0.77
imprisonment
0.74
icts
0.74
harshly
0.73
Activations Density 0.022%