INDEX
    Explanations

    profanity and aggressive language

    New Auto-Interp
    Negative Logits
    ISC
    -0.99
    Newsletter
    -0.94
    Interstitial
    -0.94
    Reviewer
    -0.84
    xit
    -0.82
    krit
    -0.81
    031
    -0.81
    MSN
    -0.79
    quished
    -0.75
    RANT
    -0.75
    POSITIVE LOGITS
     wanna
    0.76
     dop
    0.76
     tha
    0.75
     lifetime
    0.73
     dat
    0.71
     punk
    0.68
    pace
    0.65
     queer
    0.64
     kid
    0.64
     gonna
    0.63
    Act Density 5.376%

    No Known Activations