INDEX
    Explanations

    the word "The" at the beginning of sentences

    instances of the word "the" and phrases indicating problems or issues

    New Auto-Interp
    Negative Logits
     Dunham
    -0.70
     Mons
    -0.64
    Redditor
    -0.61
     Rez
    -0.57
    apego
    -0.56
     Quote
    -0.55
    sever
    -0.54
    hon
    -0.54
     Eleven
    -0.54
    ammy
    -0.54
    POSITIVE LOGITS
    Catalog
    0.62
    esa
    0.60
    cms
    0.58
    YC
    0.57
    xia
    0.56
     Flavoring
    0.55
    ESA
    0.55
    sci
    0.54
    irts
    0.54
    thouse
    0.54
    Act Density 0.041%

    No Known Activations