INDEX
Explanations
words related to geographical descriptions, particularly those denoting location or regions
New Auto-Interp
Negative Logits
indi
-0.15
rsp
-0.14
stown
-0.14
underlying
-0.14
cott
-0.14
รà¸ĩ
-0.14
ã쮿ĸ¹
-0.13
Marcel
-0.13
substr
-0.13
kad
-0.13
POSITIVE LOGITS
ern
0.26
erne
0.23
outer
0.22
ternal
0.21
tern
0.21
erna
0.20
eri
0.20
erno
0.20
erie
0.20
erior
0.19
Activations Density 0.005%