INDEX
Explanations
mentions of family members, particularly sons
references to "son" in various contexts
New Auto-Interp
Negative Logits
veyard
-0.82
iculty
-0.78
kefeller
-0.73
proport
-0.63
orney
-0.63
pmwiki
-0.62
manifold
-0.61
topic
-0.61
PORT
-0.61
Population
-0.61
POSITIVE LOGITS
hood
1.00
Gohan
0.98
hesis
0.89
nets
0.86
heses
0.83
pins
0.82
ogram
0.82
son
0.80
friend
0.78
Barron
0.77
Activations Density 0.019%