INDEX
Explanations
personal possessive pronouns and verbs indicating possession
pronouns and references to people in various contexts
New Auto-Interp
Negative Logits
è£ıè¦ļéĨĴ
-0.67
departures
-0.66
ascular
-0.66
========
-0.66
intervening
-0.65
lihood
-0.65
jri
-0.64
ETHOD
-0.64
DonaldTrump
-0.63
crit
-0.63
POSITIVE LOGITS
wore
1.11
stole
1.08
bought
1.06
purchased
1.03
invented
0.97
crafted
0.91
borrowed
0.89
fashioned
0.88
inherited
0.88
planted
0.85
Activations Density 0.179%