INDEX
Explanations
personal pronouns indicating possession or relationship
first-person pronouns and their associated verbs
New Auto-Interp
Negative Logits
amaz
-0.76
Barron
-0.72
ourgeois
-0.70
hatt
-0.69
geries
-0.66
Revolution
-0.65
yss
-0.65
Ukrain
-0.63
ategory
-0.62
inctions
-0.61
POSITIVE LOGITS
deems
0.97
deemed
0.94
cherish
0.89
sorely
0.86
deem
0.86
hadn
0.81
dearly
0.81
cannot
0.80
couldn
0.78
knew
0.78
Activations Density 0.169%