INDEX
Explanations
references to prisons and corrections
New Auto-Interp
Negative Logits
utenberg
-0.15
Ferrari
-0.14
BootApplication
-0.14
URA
-0.14
Ember
-0.14
Automobile
-0.14
franch
-0.13
elegance
-0.13
.removeChild
-0.13
poll
-0.13
POSITIVE LOGITS
inmate
0.34
inmates
0.31
prisoner
0.30
prisoners
0.28
prison
0.27
cell
0.26
guards
0.25
Prison
0.25
yard
0.23
lockdown
0.22
Activations Density 0.028%