INDEX
    Explanations

    the word "don't."

    negative contractions of "do not"

    New Auto-Interp
    Negative Logits
    soType
    -0.70
    itiz
    -0.69
    Reviewer
    -0.67
     Reloaded
    -0.67
     Penguin
    -0.64
    estern
    -0.64
    çĦ
    -0.64
    edIn
    -0.63
     Gry
    -0.59
    pter
    -0.59
    POSITIVE LOGITS
     necessarily
    1.16
     bother
    1.04
     know
    0.86
     anymore
    0.86
     seem
    0.86
    necess
    0.86
     intend
    0.85
    urtles
    0.85
     appreciate
    0.85
     expect
    0.84
    Act Density 0.104%

    No Known Activations