INDEX
    Explanations

    social media handles and posts

    New Auto-Interp
    Negative Logits
    cised
    -0.68
     CFR
    -0.68
    heny
    -0.66
     displacement
    -0.63
     Dracula
    -0.63
    rugged
    -0.63
     Quart
    -0.62
    enary
    -0.62
     Stevens
    -0.61
    zik
    -0.61
    POSITIVE LOGITS
    username
    1.12
     Username
    1.10
     username
    1.05
     postings
    0.94
    pages
    0.92
     chats
    0.91
    Reddit
    0.88
     user
    0.88
     feeds
    0.88
     hashtag
    0.88
    Act Density 3.981%

    No Known Activations