INDEX
Explanations
information related to family members and their relationships
mentions of familial relationships and dynamics
New Auto-Interp
Negative Logits
rium
-0.67
Magikarp
-0.63
verage
-0.60
rieg
-0.59
osphere
-0.58
aggregate
-0.57
takedown
-0.57
Vote
-0.57
.–
-0.56
Recommend
-0.56
POSITIVE LOGITS
boyfriend
0.90
married
0.90
daughters
0.88
married
0.88
granddaughter
0.86
daughter
0.85
nursing
0.83
estranged
0.81
grandmother
0.80
wheelchair
0.80
Activations Density 1.983%