INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     IDENT
    -0.07
    -use
    -0.06
    cular
    -0.06
    .UserName
    -0.06
     predictive
    -0.06
     بعدی
    -0.06
     bach
    -0.06
    -0.06
     účast
    -0.06
    ранения
    -0.06
    POSITIVE LOGITS
     Normalize
    0.06
     MASS
    0.06
     kullanıcı
    0.06
     BEEN
    0.06
     Jimmy
    0.06
     Auf
    0.06
     LUA
    0.06
     logfile
    0.06
    $msg
    0.06
     gerektiğini
    0.06
    Act Density 0.006%

    No Known Activations