INDEX
Explanations
pronouns and possessive forms indicating relationships and ownership
New Auto-Interp
Negative Logits
atisch
-0.16
venes
-0.16
ogue
-0.16
-mf
-0.16
gre
-0.15
untos
-0.14
å¹²
-0.14
cete
-0.14
sobie
-0.14
illes
-0.14
POSITIVE LOGITS
etz
0.20
osu
0.15
Mayo
0.15
éħ
0.15
ARRIER
0.15
ูม
0.14
outward
0.14
sip
0.14
баÑĩ
0.14
lied
0.14
Activations Density 0.008%