INDEX
Explanations
names of individuals or places
New Auto-Interp
Negative Logits
alu
-0.18
loquent
-0.17
ovnÃŃ
-0.17
alien
-0.17
овÑĸ
-0.16
alach
-0.15
ovation
-0.15
ignum
-0.15
ov
-0.15
wart
-0.15
POSITIVE LOGITS
sky
0.36
sk
0.35
ski
0.32
itch
0.28
SK
0.28
ksi
0.24
ichi
0.24
ici
0.23
icz
0.23
ych
0.23
Activations Density 0.026%