INDEX
    Explanations

    content related to violence and content ratings

    Content ratings and offensive language

    New Auto-Interp
    Negative Logits
    WithIOException
    -0.72
    ]='\
    -0.71
    featureID
    -0.65
    DockStyle
    -0.63
    SOUNDBITE
    -0.62
     ilusión
    -0.59
    __':
    
    -0.57
     createSprite
    -0.56
    __':
    -0.56
    ==""){
    -0.54
    POSITIVE LOGITS
     vulgar
    0.96
     swearing
    0.93
     explicit
    0.91
     obscene
    0.91
     swear
    0.90
     NSFW
    0.89
     nudity
    0.86
     prof
    0.85
     obsc
    0.84
     swears
    0.84
    Act Density 0.208%

    No Known Activations