INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     старт
    -0.09
     benod
    -0.09
     החלט
    -0.09
     განს
    -0.08
    .MEDIA
    -0.08
    keurig
    -0.08
     აუცილ
    -0.08
     parker
    -0.08
     অভিন
    -0.08
     verfol
    -0.08
    POSITIVE LOGITS
    Categorie
    0.08
    0.08
    Gujarati
    0.08
    Portugu
    0.07
    Phong
    0.07
    language
    0.07
     لغة
    0.07
    [word
    0.07
    Divide
    0.07
     fréquence
    0.07
    Act Density 0.000%

    No Known Activations