INDEX
Explanations
references to possession or ownership
New Auto-Interp
Negative Logits
ni
-0.17
aw
-0.15
@brief
-0.14
ampo
-0.14
akan
-0.14
yle
-0.14
anos
-0.13
ano
-0.13
ypsum
-0.13
awl
-0.13
POSITIVE LOGITS
rottle
0.16
uvo
0.15
meli
0.15
=&
0.15
ranÃŃ
0.14
.mods
0.14
'gc
0.14
kdir
0.14
NSE
0.14
933
0.14
Activations Density 0.166%