INDEX
    Explanations

    profane terms or strong language

    explicit language and strong profanity

    New Auto-Interp
    Negative Logits
     elig
    -0.73
     conduc
    -0.69
    Buyable
    -0.67
    utterstock
    -0.63
     ancest
    -0.63
     Nav
    -0.63
     mosqu
    -0.62
    è£ħ
    -0.62
     isolation
    -0.61
     accomp
    -0.61
    POSITIVE LOGITS
    tty
    1.17
    cking
    1.13
    gger
    1.08
    shit
    1.08
    kers
    1.05
    king
    1.05
    holes
    1.03
    tch
    1.02
    hole
    1.00
    cks
    0.99
    Act Density 0.108%

    No Known Activations