INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _ms
    -0.08
     assurances
    -0.07
     diction
    -0.07
    Billing
    -0.07
    zuk
    -0.07
     payroll
    -0.07
    enje
    -0.07
    Weekly
    -0.07
    ssis
    -0.07
    שרות
    -0.07
    POSITIVE LOGITS
    jenige
    0.08
    ijden
    0.08
     lòt
    0.08
    0.08
    ,G
    0.08
    0.07
    ичество
    0.07
    �ి
    0.07
    имое
    0.07
     Wanted
    0.07
    Act Density 0.004%

    No Known Activations