INDEX
Explanations
references to prison and criminal activity
New Auto-Interp
Negative Logits
arin
-0.16
ุà¹Ī
-0.15
á»Ń
-0.15
Foreign
-0.14
anners
-0.14
lico
-0.14
aar
-0.14
foreign
-0.14
Foreign
-0.14
Ïħνα
-0.14
POSITIVE LOGITS
prison
0.54
inmate
0.48
prisoner
0.47
Prison
0.46
prisoners
0.45
inmates
0.45
prisons
0.41
jail
0.37
incarcerated
0.33
Corrections
0.33
Activations Density 0.173%