INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     grupo
    -0.09
     amplio
    -0.08
    Aquí
    -0.07
     ચૂ
    -0.07
     India
    -0.07
     Mete
    -0.07
     interessado
    -0.07
     पीछ
    -0.07
     interesado
    -0.07
    -for
    -0.07
    POSITIVE LOGITS
    ылар
    0.08
    .Crud
    0.08
     करी
    0.08
    (IService
    0.08
     ninguém
    0.08
     robin
    0.08
    \e
    0.07
     nexus
    0.07
     Kos
    0.07
    кра
    0.07
    Act Density 0.003%

    No Known Activations