INDEX
    Explanations

    references to human trafficking and exploitation

    New Auto-Interp
    Negative Logits
    ully
    -0.16
    orges
    -0.16
    rief
    -0.16
    inox
    -0.15
    archy
    -0.15
     Fleet
    -0.15
    olt
    -0.15
    stab
    -0.14
     toJSON
    -0.14
    ض
    -0.14
    POSITIVE LOGITS
     trafficking
    0.26
     traff
    0.25
     Traff
    0.23
     slavery
    0.19
     bonded
    0.19
     bondage
    0.19
     human
    0.19
     sex
    0.18
     child
    0.18
     traf
    0.18
    Act Density 0.020%

    No Known Activations