INDEX
    Explanations

    mentions of illegal activities or situations

    terms associated with illegal activities and crimes

    New Auto-Interp
    Negative Logits
    rike
    -0.82
    atche
    -0.80
    haps
    -0.79
    igating
    -0.76
    phasis
    -0.75
    vet
    -0.73
    lov
    -0.72
    attr
    -0.71
    eting
    -0.71
    oir
    -0.71
    POSITIVE LOGITS
     illegal
    1.22
     illegally
    1.02
     trafficking
    0.95
     Illegal
    0.95
     unauthorized
    0.85
     illeg
    0.83
    illegal
    0.83
     aliens
    0.82
     unlawful
    0.81
     immigrant
    0.80
    Act Density 0.015%

    No Known Activations