INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     مصرف
    -0.08
     revisions
    -0.08
     دوري
    -0.08
    DW
    -0.08
     alternatief
    -0.08
    EMENTS
    -0.08
     دو
    -0.07
     nostru
    -0.07
     conjunto
    -0.07
    ментов
    -0.07
    POSITIVE LOGITS
     चिं
    0.08
    Interrupted
    0.08
     कैसी
    0.08
    _about
    0.08
    Implemented
    0.08
     केली
    0.08
     females
    0.08
     নিশ্চ
    0.07
    Zak
    0.07
    _defined
    0.07
    Act Density 0.004%

    No Known Activations