INDEX
    Explanations

    words relating to violence or harm, particularly the act of killing

    references to the act of killing

    New Auto-Interp
    Negative Logits
    BuyableInstoreAndOnline
    -0.85
    Cola
    -0.78
    Scot
    -0.75
    ĸļ
    -0.71
    itle
    -0.71
    fty
    -0.70
    DragonMagazine
    -0.69
     Depot
    -0.69
    soType
    -0.67
    rypt
    -0.66
    POSITIVE LOGITS
     spree
    1.00
    mails
    0.85
    switch
    0.84
     civilians
    0.83
    joy
    0.83
     innocent
    0.76
     off
    0.73
    mong
    0.73
     innoc
    0.70
     rampage
    0.70
    Act Density 0.047%

    No Known Activations