INDEX
    Explanations

    online social interactions or engagement related terms

    references to social media sharing and related interactions

    New Auto-Interp
    Negative Logits
     Greenberg
    -0.71
     unaccount
    -0.62
     inexplicable
    -0.62
     Dyn
    -0.61
     lining
    -0.60
     Zak
    -0.59
     undisclosed
    -0.58
     Starr
    -0.58
     manif
    -0.58
     sealing
    -0.57
    POSITIVE LOGITS
    Share
    0.96
    ãĤ¨ãĥ«
    0.93
    advertising
    0.76
    Pinterest
    0.75
    Rate
    0.72
     Share
    0.72
    eria
    0.72
    Spread
    0.71
    lesh
    0.71
    Reddit
    0.69
    Act Density 0.065%

    No Known Activations