INDEX
Explanations
pronouns and possessive forms indicating personal experiences or ownership
New Auto-Interp
Negative Logits
xl
-0.07
-0.07
as
-0.06
Arb
-0.06
rq
-0.06
ektor
-0.06
ovat
-0.05
apt
-0.05
utschein
-0.05
Lager
-0.05
POSITIVE LOGITS
äºĮ人
0.08
ãĥ³ãĥĶ
0.07
fcn
0.07
kara
0.07
.hd
0.07
ÑĢÑĥÑĩ
0.07
åı·
0.07
åĿ¦
0.07
styleType
0.07
isposable
0.07
Activations Density 0.013%