INDEX
Explanations
vulgar language
intense and frequent use of profanity, particularly the term "fucking."
New Auto-Interp
Negative Logits
BIL
-0.83
anu
-0.81
Msg
-0.81
laus
-0.81
endant
-0.80
Folder
-0.75
opian
-0.74
ct
-0.73
ère
-0.71
mere
-0.71
POSITIVE LOGITS
kidding
0.99
idiot
0.83
prick
0.82
fucking
0.81
idiots
0.80
bastard
0.79
goddamn
0.77
asshole
0.76
hell
0.74
huge
0.74
Activations Density 0.032%