INDEX
    Explanations

    crime and death

    New Auto-Interp
    Negative Logits
    sts
    -0.07
     plates
    -0.07
     Indiana
    -0.07
     Churches
    -0.06
    plib
    -0.06
    ,F
    -0.06
     (`
    -0.06
     Neutral
    -0.06
    ,f
    -0.06
    .week
    -0.06
    POSITIVE LOGITS
    0.06
     +
    0.06
     estamos
    0.06
     Del
    0.06
     треть
    0.06
     Eğitim
    0.06
    drž
    0.06
     Sne
    0.06
    не
    0.06
    0.06
    Act Density 0.010%

    No Known Activations