INDEX
    Explanations

    mentions of 'the' in specific contexts

    occurrences of the word "the"

    New Auto-Interp
    Negative Logits
     accordingly
    -0.76
     furthermore
    -0.76
    merce
    -0.76
     anew
    -0.74
    Layer
    -0.71
    olicy
    -0.69
    ocument
    -0.66
    ooth
    -0.66
     nevertheless
    -0.65
    Versions
    -0.63
    POSITIVE LOGITS
     outset
    1.25
     aforementioned
    1.16
     standpoint
    1.13
     same
    1.05
     confines
    0.99
     depths
    0.98
     earliest
    0.93
     foregoing
    0.89
     beginnings
    0.89
     usual
    0.89
    Act Density 0.198%

    No Known Activations