INDEX
Explanations
personal experiences and relationships
possessive pronouns and references to personal relationships
New Auto-Interp
Negative Logits
ablishment
-0.76
vernment
-0.72
urden
-0.70
ilib
-0.70
ellation
-0.69
generated
-0.69
Ô
-0.68
redits
-0.68
apiece
-0.66
beit
-0.66
POSITIVE LOGITS
girlfriend
1.41
roommate
1.39
boyfriend
1.37
mom
1.34
grandma
1.34
roomm
1.34
dad
1.33
niece
1.33
fiance
1.30
wife
1.30
Activations Density 0.231%