INDEX
    Explanations

    murder, death, crime

    New Auto-Interp
    Negative Logits
    Killing
    -0.82
     killing
    -0.82
     murdered
    -0.77
     kill
    -0.75
    kill
    -0.73
     KILL
    -0.73
     murders
    -0.71
     Killing
    -0.71
     murder
    -0.70
     killed
    -0.70
    POSITIVE LOGITS
     bahay
    0.61
     rung
    0.55
    ształ
    0.51
     refuge
    0.50
     scouting
    0.49
     crowning
    0.49
     resolutions
    0.49
     loob
    0.49
     brim
    0.47
     Mep
    0.47
    Act Density 0.028%

    No Known Activations