INDEX
Explanations
a variety of proper names and titles associated with individuals and roles in a community or institution
New Auto-Interp
Negative Logits
öy
-0.15
ubby
-0.15
話
-0.15
ÄĽj
-0.15
imenti
-0.15
stown
-0.15
ead
-0.14
stub
-0.14
ordable
-0.14
cape
-0.14
POSITIVE LOGITS
wal
0.23
odia
0.21
olia
0.17
izada
0.16
oria
0.16
ÑħÑĥ
0.15
Roy
0.15
urve
0.15
hani
0.15
aria
0.15
Activations Density 0.129%