INDEX
    Explanations

    references to attacks and aggressive actions

    "attack" or "attacks"

    New Auto-Interp
    Negative Logits
    liğini
    -0.46
     zelve
    -0.46
     Vernunft
    -0.46
     Picchu
    -0.45
     acreditar
    -0.45
     huesos
    -0.44
    HasBeenSet
    -0.43
    rungsseite
    -0.42
    -0.42
     désolés
    -0.42
    POSITIVE LOGITS
     attack
    2.36
    attack
    2.14
     Attack
    2.08
     attacks
    1.98
    Attack
    1.93
     ATTACK
    1.92
     attacked
    1.84
     attacking
    1.82
     Attacks
    1.73
    attacks
    1.73
    Act Density 0.129%

    No Known Activations