INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     collective
    -0.08
    ueless
    -0.07
    વાનો
    -0.07
    collect
    -0.07
     cursed
    -0.07
     કરવાનો
    -0.07
     collectively
    -0.07
    estima
    -0.07
    .Department
    -0.07
     toplam
    -0.07
    POSITIVE LOGITS
     wiederum
    0.09
     снова
    0.09
     שוב
    0.08
     erneut
    0.08
     nuevamente
    0.08
     naman
    0.08
     Gabriel
    0.08
     هل
    0.08
     desal
    0.08
     maneh
    0.08
    Act Density 0.018%

    No Known Activations