INDEX
    Explanations

    words related to violence

    references to violence or violent acts

    New Auto-Interp
    Negative Logits
     Premium
    -0.81
    Pod
    -0.77
    DK
    -0.74
    rin
    -0.72
     Boost
    -0.72
     TTL
    -0.71
     Sparkle
    -0.71
    elle
    -0.71
     Fleet
    -0.69
     Labs
    -0.69
    POSITIVE LOGITS
     violent
    3.37
    violent
    2.65
     Violent
    2.33
     violence
    2.08
     nonviolent
    2.03
    violence
    1.91
     violently
    1.79
    Viol
    1.76
     murderous
    1.71
     Violence
    1.65
    Act Density 0.020%

    No Known Activations