INDEX
    Explanations

    occurrences of social media posts

    instances of the word "posted."

    New Auto-Interp
    Negative Logits
    inho
    -0.73
    idge
    -0.71
    isma
    -0.71
    icably
    -0.68
    ppo
    -0.66
    hart
    -0.65
     Galile
    -0.64
    phant
    -0.64
    pter
    -0.64
     Gand
    -0.63
    POSITIVE LOGITS
     posting
    0.91
    posted
    0.91
    uploads
    0.90
     postings
    0.86
    gres
    0.83
     posts
    0.82
    ulate
    0.81
    doctoral
    0.81
    hum
    0.78
     posted
    0.77
    Act Density 0.025%

    No Known Activations