INDEX
Explanations
words related to inappropriate behavior or language
terms related to indecency and obscenity
New Auto-Interp
Negative Logits
olin
-0.81
quickShipAvailable
-0.79
ACP
-0.77
iller
-0.75
Airl
-0.73
arij
-0.72
ochond
-0.71
VPN
-0.71
Oracle
-0.70
¯¯¯¯¯¯¯¯
-0.68
POSITIVE LOGITS
lewd
1.20
indecent
1.12
obscene
0.85
blasp
0.77
ejac
0.75
Sexual
0.74
vulgar
0.72
writ
0.72
masturb
0.71
uously
0.70
Activations Density 0.016%