INDEX
    Explanations

    proper nouns with titles or names

    instances of ending punctuation

    New Auto-Interp
    Negative Logits
    etheless
    -0.75
    wcs
    -0.72
     advis
    -0.70
     secondly
    -0.70
    translation
    -0.70
     charact
    -0.68
    ãĥĦ
    -0.64
     ignition
    -0.63
    ãĤ¨ãĥ«
    -0.63
     arrang
    -0.63
    POSITIVE LOGITS
     Smith
    1.03
     Olson
    1.00
     Miller
    1.00
     Baker
    1.00
     Bernstein
    0.99
     Stephens
    0.98
     Gors
    0.98
     Ware
    0.97
     Decker
    0.96
     Peterson
    0.96
    Act Density 0.034%

    No Known Activations