INDEX
Explanations
narratives involving parent-child relationships, particularly focusing on fathers and their children
New Auto-Interp
Negative Logits
ioxide
-0.88
otos
-0.78
IUM
-0.78
BTC
-0.77
nit
-0.76
Quant
-0.71
Flavoring
-0.69
ilib
-0.68
risome
-0.68
Ĥª
-0.67
POSITIVE LOGITS
grandparents
1.32
grandmother
1.24
daughters
1.15
daughter
1.09
grandfather
1.09
sisters
1.03
sons
1.03
husbands
1.03
siblings
1.02
daughter
1.01
Activations Density 0.082%