INDEX
Explanations
names of authors and notable historical figures
New Auto-Interp
Negative Logits
.Selenium
-0.18
знаком
-0.17
ragaz
-0.17
otos
-0.16
iná
-0.16
igner
-0.16
aeda
-0.15
öm
-0.15
ash
-0.14
aka
-0.14
POSITIVE LOGITS
þ
0.20
Crist
0.20
þ
0.18
Exped
0.18
clerk
0.17
hadde
0.16
werk
0.16
awns
0.15
weren
0.15
noon
0.15
Activations Density 0.002%