INDEX
    Explanations

    descriptions of illegal or unethical activities

    instances of scams and criminal activities involving manipulation or deception

    New Auto-Interp
    Negative Logits
    ĪĴ
    -0.78
    ãĥ´
    -0.71
     reflection
    -0.70
     limitation
    -0.68
     Reflect
    -0.68
     Debate
    -0.68
    equal
    -0.67
    olon
    -0.67
    eatures
    -0.67
    utral
    -0.67
    POSITIVE LOGITS
     prostitutes
    1.29
     prostitute
    1.22
     pornographic
    1.21
     extortion
    1.20
     blackmail
    1.18
     smugg
    1.16
     prostitution
    1.15
     ransom
    1.14
     traffickers
    1.13
     bribes
    1.12
    Act Density 0.975%

    No Known Activations