INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ورية
    -0.07
     نور
    -0.06
    -master
    -0.06
    ALAR
    -0.06
    otomy
    -0.06
     tensor
    -0.06
    768
    -0.06
    اريخ
    -0.06
    тора
    -0.06
     pred
    -0.06
    POSITIVE LOGITS
     amac
    0.07
     she
    0.06
    .LoadScene
    0.06
     удоб
    0.06
     Thinking
    0.06
    cheiden
    0.06
    SION
    0.06
     가지
    0.06
     hallmark
    0.06
    webpack
    0.06
    Act Density 0.050%

    No Known Activations