INDEX
    Explanations

    social media platform names specifically Facebook

    New Auto-Interp
    Negative Logits
    bilt
    -0.74
    rals
    -0.73
    plane
    -0.71
    ACTED
    -0.68
    ppa
    -0.66
    tti
    -0.63
    akin
    -0.63
    rans
    -0.62
    gran
    -0.62
    stood
    -0.62
    POSITIVE LOGITS
     Twitter
    1.10
    Twitter
    0.92
     Pinterest
    0.85
     Comments
    0.85
     Tweet
    0.82
    Tumblr
    0.79
    twitter
    0.76
     Email
    0.76
     Likes
    0.76
    LinkedIn
    0.73
    Act Density 0.040%

    No Known Activations