INDEX
Explanations
words related to prisons and incarceration
New Auto-Interp
Negative Logits
£ı
-0.70
ãĥ£
-0.70
thora
-0.70
yip
-0.68
endor
-0.66
OPA
-0.65
rians
-0.64
soDeliveryDate
-0.63
cs
-0.62
sonian
-0.61
POSITIVE LOGITS
inmates
1.20
inmate
1.11
confinement
1.06
prisoners
1.02
prison
1.01
sentences
0.96
prison
0.95
jails
0.93
prisons
0.92
convict
0.90
Activations Density 0.843%