INDEX
    Explanations

    references to familial relationships, particularly focusing on sons and daughters

    New Auto-Interp
    Negative Logits
     InputDecoration
    -0.81
    WriteBarrier
    -0.78
    Vega
    -0.74
    ETHING
    -0.73
     nol
    -0.72
     }}"></
    -0.72
     nulle
    -0.71
     PMC
    -0.71
    IsContent
    -0.71
    </thead>
    -0.70
    POSITIVE LOGITS
     son
    1.83
     SON
    1.81
     Son
    1.81
     sons
    1.79
    Son
    1.64
     Sons
    1.61
    Sons
    1.53
    son
    1.50
     SONS
    1.48
     Daughter
    1.36
    Act Density 0.057%

    No Known Activations