INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     the
    -0.86
     of
    -0.76
     back
    -0.74
     that
    -0.74
     at
    -0.73
     all
    -0.73
     them
    -0.72
     early
    -0.71
     then
    -0.71
     another
    -0.70
    POSITIVE LOGITS
    الحياه
    0.82
     disambiguazione
    0.78
    RenderAtEndOf
    0.78
    ThroughAttribute
    0.75
     doInBackground
    0.74
     ffilm
    0.73
    rungsseite
    0.73
     مشين
    0.70
    Vidite
    0.69
    المكان
    0.65
    Act Density 0.582%

    No Known Activations