INDEX
    Explanations

    expressions indicating an increase or enhancement

    New Auto-Interp
    Negative Logits
    hani
    -0.18
    asers
    -0.17
     Nug
    -0.16
    ãģĻãģİ
    -0.16
    AINED
    -0.16
    acha
    -0.15
    asia
    -0.14
    RectTransform
    -0.14
    amage
    -0.14
     localVar
    -0.14
    POSITIVE LOGITS
     than
    0.18
     reverse
    0.17
    Than
    0.16
    irth
    0.16
    eger
    0.15
    MORE
    0.14
     underlying
    0.14
     Works
    0.14
     worse
    0.14
    more
    0.14
    Act Density 0.054%

    No Known Activations