INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     müm
    -0.09
    abilir
    -0.09
     ممکن
    -0.09
    ahle
    -0.09
     мәз
    -0.09
    -Unis
    -0.09
     mümkin
    -0.09
     povas
    -0.08
    -slider
    -0.08
     installiert
    -0.08
    POSITIVE LOGITS
     nephew
    0.08
    "H
    0.08
     intitul
    0.08
    0.07
     Zac
    0.07
     nurse
    0.07
     драм
    0.07
     androgen
    0.07
     boobs
    0.07
     oncology
    0.07
    Act Density 0.004%

    No Known Activations