INDEX
    Explanations

    phrases related to physical conflict or altercations

    actions and events related to confrontation or violence

    New Auto-Interp
    Negative Logits
     Inher
    -0.71
    etheless
    -0.65
    efeated
    -0.65
     inher
    -0.63
     bestselling
    -0.63
     quant
    -0.61
     dearly
    -0.61
     prag
    -0.61
     archives
    -0.61
    obyl
    -0.60
    POSITIVE LOGITS
     altercation
    0.67
     startled
    0.67
     disturbance
    0.66
     gunfire
    0.66
     grabbed
    0.65
     agitated
    0.63
    Stop
    0.62
     takedown
    0.62
     overpowered
    0.61
     yelling
    0.61
    Act Density 0.872%

    No Known Activations