INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     নি�
    -0.08
     జీవ
    -0.08
     Etter
    -0.08
    Interpol
    -0.07
    .margin
    -0.07
     మూ�
    -0.07
     ప్ల
    -0.07
     Após
    -0.07
    .sa
    -0.07
     Lebens
    -0.07
    POSITIVE LOGITS
    presentation
    0.09
    ummet
    0.09
    certificate
    0.08
     opties
    0.08
    ienst
    0.08
     aanged
    0.08
     pointers
    0.08
     respectivamente
    0.08
     برابر
    0.08
    aporan
    0.08
    Act Density 0.006%

    No Known Activations