INDEX
    Explanations

    phrases related to physical aggression and sports outcomes

    New Auto-Interp
    Negative Logits
     linh
    -0.43
     Formula
    -0.43
    Alert
    -0.42
     attempt
    -0.42
    Reg
    -0.41
    icio
    -0.41
    CHAPTER
    -0.40
    ista
    -0.40
    -0.40
    reg
    -0.40
    POSITIVE LOGITS
    :✨
    0.99
     annihilation
    0.89
     arras
    0.86
    setVerticalGroup
    0.84
     slaugh
    0.81
     slaughtered
    0.81
     Савезне
    0.81
     beating
    0.78
     mince
    0.78
     slaughter
    0.77
    Act Density 0.204%

    No Known Activations