INDEX
Explanations
possessive pronouns indicating personal ownership or relationships
New Auto-Interp
Negative Logits
ero
-0.17
aidu
-0.16
isode
-0.15
cene
-0.15
iset
-0.15
.syntax
-0.14
urette
-0.14
iosa
-0.14
ssp
-0.14
ifax
-0.14
POSITIVE LOGITS
own
0.17
behalf
0.16
spare
0.15
arsenal
0.15
downtime
0.15
travels
0.14
ESA
0.14
/feed
0.14
stead
0.14
0.14
Activations Density 0.212%