INDEX
Explanations
names of family members and relationships
New Auto-Interp
Negative Logits
iliation
-0.17
eba
-0.16
alice
-0.16
Made
-0.16
że
-0.16
Meredith
-0.15
Made
-0.15
enne
-0.15
ixe
-0.15
Anne
-0.15
POSITIVE LOGITS
Heath
0.17
Levi
0.16
Dust
0.16
Dakota
0.16
Jason
0.16
Aaron
0.15
Brand
0.15
Brandon
0.15
Core
0.15
Corey
0.15
Activations Density 0.116%