INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     cặp
    -0.07
    -0.07
    ships
    -0.06
    !↵↵↵↵↵↵
    -0.06
     spontaneously
    -0.06
    .ComponentResourceManager
    -0.06
    מסוגל
    -0.06
    -0.06
    。↵↵↵↵
    -0.06
    招商引
    -0.06
    POSITIVE LOGITS
    0.08
    𝅪
    0.07
    úb
    0.07
     sucess
    0.07
    Enumer
    0.07
     OTHERWISE
    0.07
    进程
    0.07
    ̀
    0.07
     launching
    0.06
    0.06
    Act Density 0.005%

    No Known Activations