INDEX
    Explanations

    someone's son or daughter

    New Auto-Interp
    Negative Logits
     महिलाएं
    0.72
     adults
    0.71
     professores
    0.68
     osób
    0.65
     Adults
    0.65
     خواتین
    0.64
     kobiet
    0.62
    前辈
    0.62
     والدین
    0.61
     civilians
    0.61
    POSITIVE LOGITS
     son
    4.97
     daughter
    4.42
     sons
    4.09
     Son
    4.02
    Son
    3.99
    儿子
    3.78
     Daughter
    3.62
     daughters
    3.62
    son
    3.60
    兒子
    3.59
    Act Density 0.098%

    No Known Activations