INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ros
    -0.07
     PROP
    -0.06
    YC
    -0.06
    .hwp
    -0.06
     slun
    -0.06
     Sox
    -0.06
     капит
    -0.06
     StatusCode
    -0.06
     Skate
    -0.06
    -0.06
    POSITIVE LOGITS
     Mixing
    0.07
     Without
    0.07
     ~
    0.07
    .factor
    0.07
     without
    0.06
    0.06
    education
    0.06
    )*
    0.06
    odia
    0.06
     [.
    0.06
    Act Density 0.007%

    No Known Activations