INDEX
Explanations
expletives and vulgar language
variations of profanity and explicit language
New Auto-Interp
Negative Logits
Flavoring
-0.76
Level
-0.73
icles
-0.71
ntil
-0.68
Interstitial
-0.67
knit
-0.67
vale
-0.65
endant
-0.65
Footnote
-0.64
Serv
-0.64
POSITIVE LOGITS
kidding
1.00
stupid
0.85
retarded
0.84
idiot
0.80
nuts
0.80
THING
0.79
sucks
0.78
bastard
0.77
retard
0.77
hell
0.77
Activations Density 0.063%