INDEX
Explanations
geographical directional terms and references to locations
New Auto-Interp
Negative Logits
West
-0.17
East
-0.17
obus
-0.15
Enumerator
-0.15
ÑĢаб
-0.14
North
-0.14
assin
-0.14
бе
-0.14
ophile
-0.14
.gray
-0.14
POSITIVE LOGITS
est
0.26
sud
0.25
nord
0.24
est
0.22
sudo
0.22
'est
0.21
’est
0.21
phÃŃa
0.20
estic
0.18
uest
0.18
Activations Density 0.033%