INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Waray
    -0.51
     shales
    -0.47
     Administrativna
    -0.45
    Попис
    -0.44
    HasOne
    -0.44
    asma
    -0.44
     warts
    -0.44
    ydd
    -0.43
     Nuevas
    -0.43
     urme
    -0.42
    POSITIVE LOGITS
    findpost
    0.70
    jspx
    0.67
     consultato
    0.62
    lihatkan
    0.61
     réguli
    0.60
     hidupnya
    0.59
    SharedDtor
    0.59
    cyklopedia
    0.57
    áctenos
    0.57
     Вікіпе
    0.57
    Act Density 0.004%

    No Known Activations