INDEX
    Explanations

    social media handles or usernames

    references to specific individuals or accounts on social media

    New Auto-Interp
    Negative Logits
    terday
    -0.65
     Majesty
    -0.65
     phyl
    -0.63
     bragging
    -0.62
     unrecogn
    -0.61
    naire
    -0.61
    naires
    -0.60
     cass
    -0.60
     intangible
    -0.59
     stomp
    -0.59
    POSITIVE LOGITS
    iframe
    0.95
    News
    0.82
    Politics
    0.78
    Report
    0.76
    Subscribe
    0.73
    Brow
    0.73
    Follow
    0.72
    sports
    0.72
    Monitor
    0.71
    Latest
    0.69
    Act Density 0.179%

    No Known Activations