INDEX
Explanations
family-related terms, particularly focusing on grandmothers
references to grandmothers or related familial terms
New Auto-Interp
Negative Logits
axis
-0.80
oned
-0.73
etheus
-0.72
gged
-0.70
yss
-0.69
contiguous
-0.65
oning
-0.65
assic
-0.65
ownt
-0.65
omin
-0.64
POSITIVE LOGITS
grandmother
1.02
grandma
0.96
aunt
0.88
mother
0.87
stones
0.79
åŃIJ
0.78
Carolyn
0.78
sburgh
0.77
father
0.76
arten
0.73
Activations Density 0.005%