INDEX
Explanations
content related to violence and content ratings
Content ratings and offensive language
R rated or profanity
New Auto-Interp
Negative Logits
WithIOException
-0.72
]='\
-0.71
featureID
-0.65
DockStyle
-0.63
SOUNDBITE
-0.62
ilusión
-0.59
__':
-0.57
createSprite
-0.56
__':
-0.56
==""){-0.54
POSITIVE LOGITS
vulgar
0.96
swearing
0.93
explicit
0.91
obscene
0.91
swear
0.90
NSFW
0.89
nudity
0.86
prof
0.85
obsc
0.84
swears
0.84
Activations Density 0.208%