INDEX
    Explanations

    terms related to bullying or being bullied

    terms related to bullying and its effects

    New Auto-Interp
    Negative Logits
     Donation
    -0.77
    aeda
    -0.76
    ittance
    -0.74
    arbon
    -0.72
    ossier
    -0.72
    vid
    -0.70
    ixed
    -0.70
    cession
    -0.70
    ixture
    -0.69
    ournal
    -0.69
    POSITIVE LOGITS
     bullies
    1.24
     bullying
    1.19
     bully
    1.08
     bullied
    1.08
     behav
    0.85
    ãħĭ
    0.82
     pul
    0.80
    kids
    0.77
     slurs
    0.71
    HAEL
    0.69
    Act Density 0.012%

    No Known Activations