INDEX
    Explanations

    terms associated with violence and aggressive behavior

    New Auto-Interp
    Negative Logits
    NOPQRST
    -1.05
     مرئيه
    -0.75
    cumulative
    -0.75
    ContentAlignment
    -0.74
    awtextra
    -0.71
     Silber
    -0.70
    tling
    -0.69
     beträgt
    -0.68
     Pante
    -0.67
     vnto
    -0.66
    POSITIVE LOGITS
     Paglinawan
    1.01
     elegans
    0.86
     dress
    0.77
     isInitialized
    0.75
    pyplot
    0.75
     Hooker
    0.74
     Wray
    0.73
     Tension
    0.72
     DbSet
    0.72
     Hooks
    0.72
    Act Density 0.079%

    No Known Activations