INDEX
Explanations
pronouns and possessive pronouns indicating ownership or relationship
possessive pronouns indicating ownership or belonging
New Auto-Interp
Negative Logits
bender
-0.79
eers
-0.75
avis
-0.70
BUG
-0.70
Frazier
-0.70
Difference
-0.69
Unsure
-0.67
CENT
-0.66
igh
-0.65
inctions
-0.65
POSITIVE LOGITS
own
1.97
predecessors
1.26
respective
1.23
predecessor
1.15
peers
1.14
knees
1.13
Own
1.10
contemporaries
1.09
namesake
1.08
hometown
1.07
Activations Density 0.351%