INDEX
Explanations
possessive pronouns indicating ownership or association
pronouns related to possession
New Auto-Interp
Negative Logits
ilial
-0.64
Mahm
-0.62
Sonia
-0.61
haus
-0.61
ared
-0.60
ogan
-0.59
bender
-0.59
hov
-0.59
Izan
-0.58
Liang
-0.58
POSITIVE LOGITS
bearings
1.28
own
1.24
footing
1.02
selves
0.91
self
0.89
revenge
0.85
feet
0.84
elf
0.84
rightful
0.83
knees
0.80
Activations Density 0.106%