INDEX
Explanations
locations and names of places
New Auto-Interp
Negative Logits
/resource
-0.15
pers
-0.15
Ñĵ
-0.15
elan
-0.14
Baltimore
-0.14
TForm
-0.14
asta
-0.14
-Sah
-0.13
hete
-0.13
anka
-0.13
POSITIVE LOGITS
Berkshire
0.33
Windsor
0.28
Maiden
0.25
Asc
0.23
Thames
0.22
Reading
0.21
Asc
0.20
Sind
0.20
Sl
0.19
Tile
0.19
Activations Density 0.022%