INDEX
Explanations
instances where names ending with "son" are mentioned
names or terms associated with familial relationships
New Auto-Interp
Negative Logits
urgently
-0.68
folds
-0.67
Fas
-0.65
Libre
-0.65
restraining
-0.65
Debbie
-0.63
toilet
-0.62
deb
-0.62
retreat
-0.59
tab
-0.59
POSITIVE LOGITS
son
4.62
SON
2.22
sen
1.94
daughter
1.86
sson
1.82
sonian
1.76
father
1.41
Son
1.33
pson
1.29
kinson
1.18
Activations Density 0.016%