INDEX
    Explanations

    negation phrases or statements indicating inability or failure

    New Auto-Interp
    Negative Logits
    
    -0.81
    })$}
    -0.75
    RenderAtEndOf
    -0.72
     autorytatywna
    -0.70
    Дереккөздер
    -0.70
     defaultstate
    -0.68
    ^(@)
    -0.68
     springfox
    -0.67
    awtextra
    -0.66
    脚注の使い方
    -0.65
    POSITIVE LOGITS
     Cannot
    1.03
    cannot
    0.95
     cannot
    0.94
    Cannot
    0.94
     CANNOT
    0.83
     Unable
    0.81
     impossibility
    0.79
     impossível
    0.77
     inability
    0.76
     невозможно
    0.75
    Act Density 0.568%

    No Known Activations