INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     становника
    -0.82
    Autoritní
    -0.81
     الرياضيه
    -0.79
    fjspx
    -0.77
     Meksiku
    -0.75
     Мексичка
    -0.75
     disambiguazione
    -0.74
     Paglinawan
    -0.73
     cherchés
    -0.72
    ientôt
    -0.72
    POSITIVE LOGITS
     set
    0.54
     few
    0.50
     lot
    0.47
     wide
    0.47
     limited
    0.45
    set
    0.43
    alignSelf
    0.43
    ques
    0.42
     firm
    0.42
    wide
    0.40
    Act Density 0.041%

    No Known Activations