INDEX
Explanations
articles and possessive pronouns in the text
New Auto-Interp
Negative Logits
_INCLUDED
-0.16
endra
-0.14
uada
-0.14
.č↵
-0.14
loquent
-0.14
wand
-0.14
uvian
-0.14
.Args
-0.13
.Dispatch
-0.13
------+------+
-0.13
POSITIVE LOGITS
rim
0.15
ifo
0.14
ActiveForm
0.14
ÏĮγ
0.14
ring
0.14
Marion
0.14
Forward
0.14
Īĺ
0.14
iden
0.14
ialis
0.14
Activations Density 0.113%