INDEX
    Explanations

    negations and expressions of uncertainty or non-affirmation

    New Auto-Interp
    Negative Logits
    ãĥ©ãĥ¼
    -0.14
    zech
    -0.14
    apis
    -0.14
     sacr
    -0.14
     hang
    -0.14
    rink
    -0.13
    plies
    -0.13
     Kane
    -0.13
     Wie
    -0.13
     tel
    -0.13
    POSITIVE LOGITS
    ubbo
    0.15
    ãģ£ãģ¡
    0.15
    ystone
    0.15
    åħ·
    0.14
    AutoSize
    0.14
     Milit
    0.14
    auss
    0.14
    ÄŁu
    0.14
    ebin
    0.14
    ubat
    0.14
    Act Density 0.107%

    No Known Activations