INDEX
    Explanations

    past tense verbs ending in 'ed'

    instances of the word "hadn't."

    New Auto-Interp
    Negative Logits
     resulting
    -0.55
     Relative
    -0.54
    em
    -0.53
     Ext
    -0.53
     free
    -0.53
     CFR
    -0.52
     Example
    -0.52
    ielding
    -0.51
    ember
    -0.51
     Values
    -0.50
    POSITIVE LOGITS
     hadn
    3.26
     weren
    1.91
     didn
    1.81
     wasn
    1.80
     hasn
    1.78
    didn
    1.74
     haven
    1.69
     couldn
    1.58
     wouldn
    1.54
     didnt
    1.38
    Act Density 0.011%

    No Known Activations