INDEX
    Explanations

    references to a boy or boys

    New Auto-Interp
    Negative Logits
     mme
    -0.76
     tse
    -0.76
     tanga
    -0.74
     sii
    -0.70
     rong
    -0.69
     michel
    -0.67
     fta
    -0.67
     hek
    -0.66
     kac
    -0.65
     kien
    -0.64
    POSITIVE LOGITS
     boy
    1.39
     boys
    1.30
     Boy
    1.28
    boy
    1.27
    boys
    1.24
    Boy
    1.21
    Boys
    1.15
     BOY
    1.13
     Boys
    1.11
     BOYS
    1.03
    Act Density 0.054%

    No Known Activations