INDEX
Explanations
topics related to criminal justice and release from incarceration
New Auto-Interp
Negative Logits
UME
-0.14
Tome
-0.14
inne
-0.14
amaged
-0.14
resc
-0.14
udent
-0.14
ume
-0.14
_DF
-0.13
umes
-0.13
kea
-0.13
POSITIVE LOGITS
prison
0.29
sentenced
0.25
sentence
0.25
jail
0.24
parole
0.22
arcer
0.22
serving
0.21
sentence
0.21
incarcerated
0.20
release
0.20
Activations Density 0.171%