INDEX
Explanations
possessive pronouns followed by specific nouns
possessive pronouns referring to individuals or groups
New Auto-Interp
Negative Logits
aunder
-0.80
obin
-0.78
Balt
-0.77
lehem
-0.77
ibaba
-0.76
ulhu
-0.75
:{-0.75
uminati
-0.74
Tang
-0.74
yang
-0.74
POSITIVE LOGITS
ancestors
1.16
superiors
1.15
opponents
1.11
chances
1.09
own
1.08
fingerprints
1.07
predecessors
1.06
adversaries
1.03
parents
1.02
detractors
1.01
Activations Density 0.294%