INDEX
Explanations
references to family and social relationships
New Auto-Interp
Negative Logits
Families
-0.54
families
-0.54
_family
-0.44
Familie
-0.38
Family
-0.38
Family
-0.37
FAMILY
-0.37
familial
-0.36
-family
-0.36
family
-0.36
POSITIVE LOGITS
pets
0.18
circle
0.18
friends
0.17
church
0.17
community
0.16
individuals
0.15
231
0.15
Circle
0.15
Individuals
0.15
extended
0.15
Activations Density 0.052%