INDEX
Explanations
phrases indicating prison sentences and duration of incarceration
sentenced to prison
New Auto-Interp
Negative Logits
EndProject
-0.39
silueta
-0.32
rungsseite
-0.32
zzleHttp
-0.32
AppMethodBeat
-0.31
Relaciones
-0.31
referirse
-0.31
ofrecerte
-0.31
multicolumn
-0.30
Seen
-0.30
POSITIVE LOGITS
jail
0.92
prison
0.88
jailed
0.86
imprisonment
0.84
Jail
0.79
jail
0.77
imprison
0.72
imprisoned
0.71
Prison
0.70
prisión
0.70
Activations Density 0.029%