INDEX
    Explanations

    mentions of controversial topics or discussions

    New Auto-Interp
    Negative Logits
    ãĥ«ãĤ¯
    -0.16
    warts
    -0.14
    urally
    -0.14
    aravel
    -0.14
    addtogroup
    -0.13
     kola
    -0.13
    izoph
    -0.13
    Advertisements
    -0.13
    QRST
    -0.13
    ernote
    -0.13
    POSITIVE LOGITS
     Twitter
    0.30
     online
    0.30
     social
    0.28
     twitter
    0.27
     users
    0.25
     twe
    0.25
     reactions
    0.25
     tweets
    0.24
     posts
    0.24
     responses
    0.24
    Act Density 0.173%

    No Known Activations