INDEX
Explanations
possessive `'s` followed by nouns
New Auto-Interp
Negative Logits
the
0.33
The
0.29
the
0.28
If
0.24
ulates
0.24
</h2>
0.23
eteries
0.22
>
0.22
:"
0.22
Both
0.22
POSITIVE LOGITS
own
0.40
eigene
0.26
собственные
0.25
own
0.25
propia
0.24
kendi
0.23
raincoat
0.23
prerogative
0.23
proverbial
0.23
foray
0.23
Activations Density 0.077%