INDEX
Explanations
possessive forms of nouns indicating ownership or relation
New Auto-Interp
Negative Logits
baugh
-0.15
adlo
-0.15
urs
-0.15
olley
-0.15
лаÑĤ
-0.15
utto
-0.14
uit
-0.14
hip
-0.14
cepts
-0.14
ets
-0.14
POSITIVE LOGITS
æł·åŃIJ
0.15
own
0.15
/-
0.14
ains
0.14
_OW
0.13
плеÑĩ
0.13
="__
0.13
phinx
0.13
Own
0.13
omap
0.13
Activations Density 0.076%