INDEX
Explanations
information related to prison housing and inmate management
New Auto-Interp
Negative Logits
Son
-0.48
tiel
-0.47
Son
-0.46
чак
-0.44
Un
-0.44
spunk
-0.43
Na
-0.42
Grenada
-0.42
Na
-0.42
гат
-0.42
POSITIVE LOGITS
Houſe
0.89
Мексичка
0.86
houſe
0.86
pleaſure
0.85
Majefty
0.84
ſeveral
0.83
########.
0.82
ſtate
0.81
raiſ
0.78
extAlignment
0.77
Activations Density 0.442%