INDEX
    Explanations

    web addresses and email addresses

    sentences ending with a period

    New Auto-Interp
    Negative Logits
     basis
    -0.64
     architectural
    -0.64
     attraction
    -0.62
    ħĭ
    -0.61
     sadly
    -0.60
     backward
    -0.60
     backwards
    -0.59
     trophy
    -0.59
    onga
    -0.58
     cod
    -0.58
    POSITIVE LOGITS
    edu
    1.05
    com
    0.93
    tv
    0.90
    tumblr
    0.83
    twitter
    0.82
    blogspot
    0.82
    php
    0.81
    co
    0.80
    push
    0.80
    net
    0.79
    Act Density 0.111%

    No Known Activations