INDEX
    Explanations

    mentions of adult-related content or activities

    New Auto-Interp
    Negative Logits
    adr
    -0.85
    atility
    -0.79
    ãĤ¡
    -0.77
    gments
    -0.71
    anca
    -0.70
    sts
    -0.69
    externalActionCode
    -0.69
    wark
    -0.69
    SOURCE
    -0.68
    gment
    -0.68
    POSITIVE LOGITS
    erer
    1.09
    erers
    1.07
     Swim
    0.99
     males
    0.87
     supervision
    0.86
     male
    0.86
    hetical
    0.84
     beverages
    0.82
     diapers
    0.81
    adult
    0.80
    Act Density 0.059%

    No Known Activations