INDEX
    Explanations

    the article "the" and its various occurrences in the text

    New Auto-Interp
    Negative Logits
    SPONSORED
    -0.98
     furthermore
    -0.80
     therefore
    -0.77
    won
    -0.73
     accordingly
    -0.71
     outwe
    -0.70
     besides
    -0.69
    NB
    -0.69
     Became
    -0.68
    zon
    -0.68
    POSITIVE LOGITS
     aforementioned
    1.04
     proverbial
    1.02
     infamous
    1.02
     latter
    1.01
     usual
    0.98
     original
    0.94
     stereotypical
    0.94
     previous
    0.89
     originals
    0.88
     classic
    0.87
    Act Density 0.150%

    No Known Activations