INDEX
    Explanations

    mentions or discussions related to social media

    New Auto-Interp
    Negative Logits
    ARD
    -0.64
    nces
    -0.63
     recount
    -0.61
     stump
    -0.61
    arty
    -0.61
     retri
    -0.59
     Miko
    -0.59
    orious
    -0.59
    ELL
    -0.58
    ebin
    -0.58
    POSITIVE LOGITS
    itional
    0.99
    itions
    0.94
    icate
    0.93
    ition
    0.90
    ical
    0.87
    bots
    0.83
    nob
    0.82
    icians
    0.81
    igun
    0.81
    itism
    0.80
    Act Density 5.229%

    No Known Activations