INDEX
    Explanations

    words and phrases indicating aggressive behavior

    New Auto-Interp
    Negative Logits
    VersionUID
    -0.39
    -0.39
     '/',
    -0.38
    (!__
    -0.37
    chränk
    -0.37
    כנ
    -0.36
    theguardian
    -0.35
    "/",
    -0.35
     dú
    -0.35
    basicConfig
    -0.35
    POSITIVE LOGITS
    aggressive
    0.79
     aggressiveness
    0.73
     Aggressive
    0.71
     aggressive
    0.70
    Aggressive
    0.69
     aggressively
    0.69
     hug
    0.68
    adopt
    0.64
     adopt
    0.61
    printStackTrace
    0.61
    Act Density 0.200%

    No Known Activations