INDEX
Explanations
family relationships and details about family members
mentions of family relationships and connections
New Auto-Interp
Negative Logits
oster
-0.74
ocally
-0.62
ost
-0.61
izophren
-0.61
ilipp
-0.61
committee
-0.60
etting
-0.60
avorable
-0.60
Critics
-0.59
ocations
-0.59
POSITIVE LOGITS
whom
0.99
Jr
0.94
eldest
0.92
who
0.90
daughter
0.85
Jr
0.84
Sr
0.84
daughter
0.82
married
0.82
aka
0.80
Activations Density 0.196%