INDEX
Explanations
terms related to the relationship between fathers and sons
references to the word "father" in various contexts and discussions
New Auto-Interp
Negative Logits
psey
-0.82
ellen
-0.74
atility
-0.73
mble
-0.71
Flavoring
-0.69
AW
-0.65
CHQ
-0.64
FW
-0.64
ocobo
-0.64
graded
-0.63
POSITIVE LOGITS
hood
1.16
father
1.03
patriarch
0.96
hesis
0.94
parents
0.93
father
0.89
Father
0.86
dad
0.85
hetical
0.83
son
0.81
Activations Density 0.016%