INDEX
    Explanations

    contractions of "were not" or "had not"

    negative contractions related to existence or presence

    New Auto-Interp
    Negative Logits
     [+]
    -0.73
     DRAGON
    -0.72
    ONS
    -0.68
     Mechdragon
    -0.67
    upp
    -0.67
    lees
    -0.63
    comings
    -0.63
     PU
    -0.63
    ramid
    -0.62
     Dise
    -0.61
    POSITIVE LOGITS
    't
    1.07
    itial
    0.90
    iting
    0.73
    gery
    0.71
    announced
    0.70
    ited
    0.69
     tink
    0.69
    apolog
    0.67
    ajor
    0.67
    ÃŃ
    0.67
    Act Density 0.049%

    No Known Activations