INDEX
    Explanations

    mentions of crimes like murder

    instances of the word "murder."

    New Auto-Interp
    Negative Logits
    Cola
    -0.80
    wcsstore
    -0.76
    ais
    -0.73
    UTC
    -0.73
    BuyableInstoreAndOnline
    -0.70
    Dub
    -0.68
    imus
    -0.67
    uve
    -0.67
    ube
    -0.66
    arity
    -0.65
    POSITIVE LOGITS
     spree
    1.15
     murder
    1.03
     murders
    0.93
    ously
    0.90
     homicide
    0.89
    hyde
    0.87
     rampage
    0.86
     murderer
    0.86
     murdering
    0.85
     Murder
    0.84
    Act Density 0.031%

    No Known Activations