INDEX
    Explanations

    the word "a" followed by a word with a positive connotation

    instances of the article "a" indicating various uses or contexts

    New Auto-Interp
    Negative Logits
     unrelated
    -0.63
     establishments
    -0.60
     interests
    -0.60
     orally
    -0.60
     IDs
    -0.60
     events
    -0.59
     unpublished
    -0.59
     rates
    -0.59
     Advanced
    -0.59
     influential
    -0.58
    POSITIVE LOGITS
     tad
    1.40
     bit
    1.31
    flame
    1.10
     little
    1.07
    kward
    1.04
     lot
    1.02
    jar
    1.02
     breeze
    0.99
    versive
    0.97
    gh
    0.96
    Act Density 0.161%

    No Known Activations