INDEX
Explanations
possessive pronouns followed by nouns
possessive pronouns referring to individuals
New Auto-Interp
Negative Logits
Byr
-0.65
Seym
-0.64
Edison
-0.64
—-
-0.62
å§
-0.62
Berk
-0.62
Barrett
-0.61
Frie
-0.61
----
-0.60
âĶĢ
-0.60
POSITIVE LOGITS
favourite
0.84
own
0.83
favorite
0.78
finest
0.72
bably
0.71
sqor
0.70
oldest
0.69
youngest
0.68
ELF
0.68
itage
0.68
Activations Density 0.103%