INDEX
Explanations
references to family activities and relationships
New Auto-Interp
Negative Logits
ctors
-0.46
diese
-0.43
einzelne
-0.43
Planeten
-0.41
individual
-0.41
Users
-0.41
ones
-0.40
örös
-0.40
dieses
-0.40
Users
-0.40
POSITIVE LOGITS
hubby
1.25
dad
1.20
daughter
1.20
Dad
1.17
Daughter
1.11
wife
1.10
mom
1.10
daughter
1.10
husband
1.07
Mom
1.06
Activations Density 0.457%