INDEX
    Explanations

    contractions where 'not' is followed by a word

    the word "shouldn't" and its variations, indicating expressions of advisement or critique

    New Auto-Interp
    Negative Logits
     enthusi
    -0.80
     Powered
    -0.70
     locating
    -0.69
     withd
    -0.68
     encount
    -0.68
     gobl
    -0.66
     paran
    -0.63
     Herz
    -0.63
     fulfilled
    -0.62
     bombed
    -0.62
    POSITIVE LOGITS
    't
    1.58
    ned
    1.00
    n
    1.00
    ny
    0.99
    ighed
    0.90
    nt
    0.88
    ÃŃ
    0.86
    no
    0.83
    ouch
    0.82
    ning
    0.81
    Act Density 0.017%

    No Known Activations