INDEX
    Explanations

    words related to inappropriate behavior or language

    terms related to indecency and obscenity

    New Auto-Interp
    Negative Logits
    olin
    -0.81
    quickShipAvailable
    -0.79
    ACP
    -0.77
    iller
    -0.75
     Airl
    -0.73
    arij
    -0.72
    ochond
    -0.71
    VPN
    -0.71
    Oracle
    -0.70
    ¯¯¯¯¯¯¯¯
    -0.68
    POSITIVE LOGITS
     lewd
    1.20
     indecent
    1.12
     obscene
    0.85
     blasp
    0.77
     ejac
    0.75
    Sexual
    0.74
     vulgar
    0.72
     writ
    0.72
     masturb
    0.71
    uously
    0.70
    Act Density 0.016%

    No Known Activations