INDEX
Explanations
names of people and places, particularly those with unique diacritical marks or accents
New Auto-Interp
Negative Logits
lar
-0.30
ìķĺ
-0.26
ìķĺëĭ¤
-0.26
ca
-0.24
ban
-0.23
ça
-0.22
va
-0.21
dır
-0.20
ra
-0.20
ta
-0.20
POSITIVE LOGITS
zet
0.25
inde
0.19
де
0.19
ény
0.19
Åij
0.19
ye
0.19
iben
0.19
ÑĢе
0.18
etty
0.18
ben
0.18
Activations Density 0.008%