INDEX
    Explanations

    Twitter links with a string representation of a Twitter.com link

    punctuation marks, specifically periods

    New Auto-Interp
    Negative Logits
     oun
    -0.94
    ortunately
    -0.89
    senal
    -0.88
     practition
    -0.88
     eleph
    -0.87
    Þ
    -0.85
    ò
    -0.84
     pione
    -0.79
    oreAnd
    -0.79
     exha
    -0.78
    POSITIVE LOGITS
    com
    1.43
    tumblr
    1.01
    wordpress
    1.01
    org
    1.00
    blogspot
    1.00
    COM
    0.96
    twitch
    0.95
    edu
    0.94
    twitter
    0.93
    github
    0.90
    Act Density 0.018%

    No Known Activations