INDEX
    Explanations

    proportions

    New Auto-Interp
    Negative Logits
     fumar
    -0.11
    ্য
    -0.08
     predefined
    -0.08
     ladan
    -0.07
     postal
    -0.07
    -0.07
    Feed
    -0.07
    -0.07
     był
    -0.07
     surrounding
    -0.07
    POSITIVE LOGITS
     Scaling
    0.12
     proporcional
    0.11
     scaling
    0.11
     Crimes
    0.10
     extrap
    0.10
    Scaling
    0.10
     scaled
    0.09
     ари
    0.09
     proportional
    0.09
     Faith
    0.09
    Act Density 0.081%

    No Known Activations