INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ویکی‌پدیا
    -0.40
     anuncios
    -0.39
    ucap
    -0.38
     Soyez
    -0.35
     át
    -0.33
    DialogComponent
    -0.32
     utilice
    -0.31
    hafen
    -0.31
     davon
    -0.31
    rvore
    -0.31
    POSITIVE LOGITS
     year
    1.06
     week
    0.82
     month
    0.74
     weekend
    0.71
     summer
    0.67
    دانشنامهٔ
    0.66
     night
    0.66
     ListTile
    0.64
    year
    0.63
     année
    0.63
    Act Density 0.012%

    No Known Activations