INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ちょっと
    -0.07
    -0.07
     diagon
    -0.07
    "Not
    -0.07
     publi
    -0.07
     falls
    -0.06
     PAL
    -0.06
     sea
    -0.06
     luận
    -0.06
     اختصاص
    -0.06
    POSITIVE LOGITS
    aab
    0.06
    _integration
    0.06
    iyesi
    0.06
    var
    0.06
     ilişk
    0.06
    avn
    0.06
    умент
    0.06
    0.06
     DbContext
    0.06
    وسف
    0.05
    Act Density 0.017%

    No Known Activations