INDEX
Explanations
mentions of family members, specifically sons
mentions of the word "son."
New Auto-Interp
Negative Logits
veyard
-0.79
hazards
-0.65
Union
-0.64
manifold
-0.61
iculty
-0.60
Population
-0.60
freight
-0.60
PORT
-0.59
accommodations
-0.58
mechanisms
-0.57
POSITIVE LOGITS
ogram
1.03
nets
1.00
hood
0.95
Barron
0.89
Gohan
0.87
pins
0.85
hesis
0.84
ograms
0.83
son
0.81
zai
0.81
Activations Density 0.032%