INDEX
Explanations
references to possessive forms, specifically the word "its"
New Auto-Interp
Negative Logits
by
-0.63
er
-0.60
ity
-0.59
to
-0.57
the
-0.55
for
-0.55
had
-0.54
is
-0.51
To
-0.51
-
-0.50
POSITIVE LOGITS
its
1.87
Its
1.42
Its
1.36
Autoritní
1.21
NameInMap
1.20
它的
1.15
Савезне
1.08
дописавши
1.08
betweenstory
1.01
expandindo
1.01
Activations Density 0.119%