INDEX
    Explanations

    events involving violence or assaults against individuals, particularly focusing on details like the method and circumstances of the attacks

    New Auto-Interp
    Negative Logits
    .cleanup
    -0.15
    imo
    -0.15
     eiusmod
    -0.15
     tác
    -0.14
     zase
    -0.14
     же
    -0.14
    ourg
    -0.13
     artık
    -0.13
     ÑĤакиÑħ
    -0.13
     kå
    -0.13
    POSITIVE LOGITS
     while
    0.38
     whilst
    0.31
     moments
    0.31
    while
    0.31
     minutes
    0.28
     WHILE
    0.27
     seconds
    0.25
    _while
    0.25
     While
    0.24
     after
    0.24
    Act Density 0.285%

    No Known Activations