INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    (AP
    -0.07
     Weekend
    -0.07
    imentary
    -0.07
    Fitness
    -0.07
     Hund
    -0.07
     toll
    -0.07
    -0.06
     package
    -0.06
    -0.06
    了不少
    -0.06
    POSITIVE LOGITS
     replic
    0.07
    公众
    0.07
     كب
    0.06
    قاسم
    0.06
    .Customer
    0.06
    _DIPSETTING
    0.06
     rats
    0.06
     Democracy
    0.06
    0.06
    店加盟
    0.06
    Act Density 0.004%

    No Known Activations