INDEX
Explanations
references to major cities and tourist attractions
New Auto-Interp
Negative Logits
ToFront
-0.17
kenin
-0.17
atte
-0.15
ãĥ©ãĥ¼
-0.15
tir
-0.15
глÑı
-0.14
.ajax
-0.14
èĭ
-0.14
kova
-0.13
lect
-0.13
POSITIVE LOGITS
atican
0.16
Lumpur
0.15
Lug
0.14
Haupt
0.13
uki
0.13
angi
0.13
Assistant
0.13
ÌĨ
0.13
Testament
0.13
oky
0.13
Activations Density 0.060%