INDEX
    Explanations

    words related to criminal activities, specifically theft

    terminology related to theft

    New Auto-Interp
    Negative Logits
     Reynolds
    -0.76
     Zucker
    -0.71
    PRES
    -0.65
     evaluates
    -0.65
     tuned
    -0.65
    enegger
    -0.65
    hel
    -0.63
    etus
    -0.63
    ghan
    -0.63
    ben
    -0.63
    POSITIVE LOGITS
     theft
    3.73
     thefts
    2.91
     Theft
    2.60
     thieves
    2.47
     thief
    2.26
     robbery
    2.07
     stealing
    2.02
     burglary
    1.93
     vandalism
    1.86
     stolen
    1.85
    Act Density 0.018%

    No Known Activations