INDEX
    Explanations

    references to girls and women

    mentions of "Girls" within various contexts

    New Auto-Interp
    Negative Logits
     convincing
    -0.77
    utherford
    -0.75
    SPONSORED
    -0.73
     loud
    -0.71
     Blumenthal
    -0.71
    ype
    -0.69
     Kaine
    -0.67
     shaking
    -0.64
     ital
    -0.64
    ypes
    -0.63
    POSITIVE LOGITS
     Girls
    1.45
    Girls
    1.35
     Actress
    0.91
     Girl
    0.90
     Boys
    0.90
     Haram
    0.87
    Apps
    0.87
    poons
    0.86
    glers
    0.85
     Fighters
    0.85
    Act Density 0.017%

    No Known Activations