INDEX
    Explanations

    specific phrases containing the word "the."

    repeated instances of the word "the."

    New Auto-Interp
    Negative Logits
    Joined
    -0.81
    lier
    -0.76
    eday
    -0.76
    eus
    -0.71
    fulness
    -0.70
    places
    -0.70
    packs
    -0.70
    autions
    -0.69
    acing
    -0.69
    fights
    -0.66
    POSITIVE LOGITS
     dreaded
    0.99
     aforementioned
    0.92
     latter
    0.91
     requisite
    0.88
     sexes
    0.85
     elusive
    0.81
     Racial
    0.71
     beloved
    0.70
     fetus
    0.70
     desired
    0.69
    Act Density 0.421%

    No Known Activations