INDEX
    Explanations

    references to boys and male children

    New Auto-Interp
    Negative Logits
     InputDecoration
    -0.79
    >")
    -0.74
    Composable
    -0.73
    CreateMap
    -0.71
     PMC
    -0.71
     للمعارف
    -0.70
    ″]
    -0.70
    IsContent
    -0.69
     "").
    -0.68
    ‬‬
    -0.68
    POSITIVE LOGITS
     boys
    1.72
     Boys
    1.72
     BOYS
    1.66
    Boy
    1.61
     BOY
    1.60
    Boys
    1.59
     boy
    1.59
     Boy
    1.56
    boy
    1.55
    boys
    1.53
    Act Density 0.048%

    No Known Activations