INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    تقاوى
    -0.97
     Paglinawan
    -0.77
    ukone
    -0.68
    -0.68
     artificiales
    -0.66
    ftagPool
    -0.66
     kasarigan
    -0.66
    djangoproject
    -0.64
    +#+
    -0.64
     Gifford
    -0.63
    POSITIVE LOGITS
    rop
    0.46
     tass
    0.46
     öns
    0.44
     initComponents
    0.42
    rav
    0.41
    Ms
    0.41
    сима
    0.41
    ńcu
    0.41
    ato
    0.40
    RequestOptions
    0.39
    Act Density 0.012%

    No Known Activations