INDEX
Explanations
pronouns indicating possession and the following noun
possessive pronouns and their associated subjects in sentences
New Auto-Interp
Negative Logits
ibaba
-0.74
Vog
-0.68
unker
-0.65
igslist
-0.65
Females
-0.62
uded
-0.61
Texture
-0.61
llular
-0.60
Defin
-0.60
housed
-0.59
POSITIVE LOGITS
own
1.44
constituents
1.11
colleague
1.08
predecessors
1.05
colleagues
1.01
ocard
1.01
superiors
1.00
predecessor
0.97
daughter
0.95
clients
0.94
Activations Density 0.272%