INDEX
Explanations
references to historical figures and places
New Auto-Interp
Negative Logits
afil
-0.16
èŤ
-0.15
ussen
-0.15
iaux
-0.15
chl
-0.15
çŃĴ
-0.14
ادت
-0.14
chute
-0.14
μÏĨ
-0.14
borg
-0.14
POSITIVE LOGITS
convict
0.32
Governor
0.26
conv
0.23
Conv
0.23
178
0.22
conv
0.22
Conv
0.21
Colony
0.21
182
0.20
governor
0.20
Activations Density 0.030%