INDEX
    Explanations

    incidents involving injury or harm to individuals, particularly in relation to violent acts

    New Auto-Interp
    Negative Logits
    ãĥŃãĥ¼
    -0.17
    assi
    -0.15
    abies
    -0.14
    us
    -0.14
    ONEY
    -0.14
    ãĤ¹ãĥĿ
    -0.13
     permalink
    -0.13
     kå
    -0.13
    å¨
    -0.13
     FileNotFoundException
    -0.13
    POSITIVE LOGITS
     woman
    0.39
     man
    0.38
     couple
    0.31
     mother
    0.27
     Woman
    0.25
     girl
    0.25
     father
    0.25
     teenager
    0.24
     teen
    0.23
     boy
    0.23
    Act Density 0.283%

    No Known Activations