INDEX
    Explanations

    phrases related to theft and illegal activities

    instances of theft or stealing

    New Auto-Interp
    Negative Logits
    ICAN
    -0.96
    natureconservancy
    -0.88
    FML
    -0.80
    HUD
    -0.75
    SPONSORED
    -0.73
    arella
    -0.71
    cular
    -0.70
    eele
    -0.70
    ICA
    -0.69
    QUI
    -0.68
    POSITIVE LOGITS
     contam
    0.75
     disguise
    0.72
     stolen
    0.66
     paradise
    0.66
     stash
    0.65
     carc
    0.64
     deposit
    0.64
    ishable
    0.63
     spoil
    0.63
     souven
    0.62
    Act Density 0.399%

    No Known Activations