INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     wink
    -1.17
    wink
    -1.02
     Wink
    -0.84
     winked
    -0.62
    zweig
    -0.57
     mitu
    -0.49
    padu
    -0.47
    épaisseur
    -0.47
     EconPapers
    -0.47
    ceder
    -0.45
    POSITIVE LOGITS
     Wikimedijinoj
    0.74
    ViewFeatures
    0.67
    imize
    0.65
    الدراسه
    0.65
    AntiForgeryToken
    0.63
    webElementXpaths
    0.62
     دیکھیے
    0.62
    CodeAttribute
    0.61
    DeleteBehavior
    0.61
    EIP
    0.59
    Act Density 0.185%

    No Known Activations