INDEX
Explanations
information related to literature and publication details
New Auto-Interp
Negative Logits
ĨĴ
-0.17
angan
-0.14
atura
-0.14
.mvp
-0.14
plural
-0.13
ochen
-0.13
ĸī
-0.13
erva
-0.13
abel
-0.13
poss
-0.13
POSITIVE LOGITS
istr
0.16
éf
0.15
WEEN
0.15
jamin
0.15
.gdx
0.14
inis
0.14
otherwise
0.14
levard
0.14
ethoven
0.14
ween
0.13
Activations Density 8.777%