INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    a
    1.42
    на
    1.25
    s
    1.23
    ات
    1.23
    ne
    1.19
    .
    1.18
    de
    1.09
    q
    1.08
    1.06
    o
    1.04
    POSITIVE LOGITS
     
    1.43
    <h4>
    1.28
     izdel
    1.16
    ل
    1.16
     It
    1.14
     proizvod
    1.14
     ﺍﻟ
    1.13
     uczni
    1.12
     powied
    1.09
    <h2>
    1.09
    Act Density 0.000%

    No Known Activations