INDEX
    Explanations

    instances of the word "threat" and related terms indicating danger or risk

    New Auto-Interp
    Negative Logits
     propOrder
    -0.59
     HasFactory
    -0.54
    IntoConstraints
    -0.52
    deed
    -0.52
     Pingback
    -0.51
    WebMethod
    -0.50
    invokeLater
    -0.47
    LikeLike
    -0.47
    retweet
    -0.46
     californ
    -0.46
    POSITIVE LOGITS
     threat
    0.77
     attacks
    0.75
     threats
    0.74
     attack
    0.71
     approval
    0.71
    Approval
    0.70
     approvals
    0.70
    attack
    0.65
     Approval
    0.64
    approval
    0.63
    Act Density 0.257%

    No Known Activations