INDEX
Explanations
references to parks and public spaces
New Auto-Interp
Negative Logits
bilt
-0.16
ÄĽle
-0.14
Gle
-0.14
iye
-0.14
ÑĤÑĶ
-0.14
Ñĩки
-0.13
оÑĩно
-0.13
Clem
-0.13
Nas
-0.13
Resort
-0.13
POSITIVE LOGITS
ningen
0.26
heten
0.22
ниÑĤе
0.17
elsen
0.16
gii
0.16
иÑĤе
0.16
tersebut
0.16
quer
0.15
aren
0.15
ului
0.15
Activations Density 0.054%