INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Демографія
    -0.41
     saites
    -0.41
    <?
    -0.40
    tvguidetime
    -0.40
     autorytatywna
    -0.38
    PRNewswire
    -0.38
     diikuti
    -0.38
    yorsunuz
    -0.38
     >=",
    -0.38
    deleteById
    -0.37
    POSITIVE LOGITS
     part
    0.69
     caufe
    0.66
    nitus
    0.63
     createState
    0.62
     nucleus
    0.61
    interopRequire
    0.60
    kmäler
    0.60
     Keny
    0.60
     preſent
    0.60
     Jefus
    0.60
    Act Density 0.020%

    No Known Activations