INDEX
    Explanations

    Twitter handles to follow

    references to social media, particularly Twitter

    New Auto-Interp
    Negative Logits
    raints
    -0.78
    etheless
    -0.65
     forgiven
    -0.64
    Cooldown
    -0.62
    bably
    -0.62
    forced
    -0.60
     Meaning
    -0.59
    venge
    -0.58
     antidepressants
    -0.57
    stood
    -0.57
    POSITIVE LOGITS
     (@
    0.96
     @
    0.91
     twitter
    0.88
    edin
    0.86
     âĿ
    0.84
     hasht
    0.78
     Twitter
    0.78
    Tweet
    0.76
     Tweet
    0.75
    facebook
    0.75
    Act Density 0.044%

    No Known Activations