INDEX
Explanations
proper nouns, specifically place names and locations
New Auto-Interp
Negative Logits
-League
-0.15
STAT
-0.14
umbo
-0.14
NO
-0.14
nid
-0.14
ãģªãģĦ
-0.14
Bronx
-0.14
ема
-0.13
ghan
-0.13
ÑĸÑģ
-0.13
POSITIVE LOGITS
-based
0.31
-area
0.25
ensis
0.22
based
0.20
region
0.19
native
0.19
-native
0.19
-Based
0.19
shire
0.18
area
0.18
Activations Density 0.287%