INDEX
Explanations
references to possessive pronouns and related expressions of ownership
New Auto-Interp
Negative Logits
مُعرِّف
-0.62
Référence
-0.55
himself
-0.53
的他
-0.51
otides
-0.51
felf
-0.50
fiance
-0.50
ituary
-0.47
kendisi
-0.47
vible
-0.46
POSITIVE LOGITS
themselves
1.74
their
1.62
Their
1.57
themselves
1.51
their
1.51
Their
1.45
they
1.37
they
1.22
THEIR
1.22
彼らの
1.20
Activations Density 0.446%