INDEX
Explanations
references to the location "jail" and activities related to it
references to jail or incarceration
New Auto-Interp
Negative Logits
omen
-0.77
igree
-0.74
atic
-0.73
drops
-0.71
Orig
-0.70
manship
-0.69
¼
-0.69
onom
-0.68
umen
-0.67
onomic
-0.64
POSITIVE LOGITS
jail
3.84
Jail
3.05
jails
2.75
prison
2.47
jailed
1.80
prisons
1.74
incarcer
1.70
Prison
1.69
prison
1.69
detention
1.69
Activations Density 0.016%