INDEX
Explanations
mentions of family relationships, specifically referring to sons
references to "son" or familial relationships involving sons
New Auto-Interp
Negative Logits
veyard
-0.74
Union
-0.73
Population
-0.65
Women
-0.65
hazards
-0.65
manifold
-0.61
conglomer
-0.58
Client
-0.57
freight
-0.57
hazard
-0.55
POSITIVE LOGITS
ogram
1.00
nets
0.98
hood
0.97
Barron
0.92
Gohan
0.86
ograms
0.85
orous
0.85
zai
0.84
athan
0.82
son
0.82
Activations Density 0.036%