INDEX
Explanations
descriptions of illegal or unethical activities
instances of scams and criminal activities involving manipulation or deception
New Auto-Interp
Negative Logits
ĪĴ
-0.78
ãĥ´
-0.71
reflection
-0.70
limitation
-0.68
Reflect
-0.68
Debate
-0.68
equal
-0.67
olon
-0.67
eatures
-0.67
utral
-0.67
POSITIVE LOGITS
prostitutes
1.29
prostitute
1.22
pornographic
1.21
extortion
1.20
blackmail
1.18
smugg
1.16
prostitution
1.15
ransom
1.14
traffickers
1.13
bribes
1.12
Activations Density 0.975%