INDEX
    Explanations

    specific lexical items or characters

    New Auto-Interp
    Negative Logits
     aéri
    -0.52
     Aner
    -0.52
    ktır
    -0.51
     casamento
    -0.47
     ligiloj
    -0.47
     parallèle
    -0.46
     kaybet
    -0.45
     religieuses
    -0.45
     Insee
    -0.44
    uyor
    -0.44
    POSITIVE LOGITS
    Дереккөздер
    0.75
    नलिखित
    0.74
    WriteBarrier
    0.73
    0.72
    awtextra
    0.71
     IActionResult
    0.71
     ModelExpression
    0.71
    LookAnd
    0.70
    </tfoot>
    0.65
    期刊论文
    0.64
    Act Density 0.032%

    No Known Activations