INDEX
    Explanations

    mentions of the social media platform Twitter

    New Auto-Interp
    Negative Logits
     Twitter
    -1.36
     twitter
    -1.30
    Twitter
    -1.30
    twitter
    -1.17
     tweeted
    -1.12
     tweet
    -1.08
     TWITTER
    -1.08
     tweeting
    -1.08
     tweets
    -1.03
     Tweet
    -0.93
    POSITIVE LOGITS
     utafitiHapana
    0.65
    GeneratedCode
    0.61
     autorytatywna
    0.58
     թվական
    0.58
     ExecuteAsync
    0.57
     للاسماء
    0.57
     تضيفلها
    0.56
    0.54
     defStyle
    0.54
    Personensuche
    0.53
    Act Density 0.180%

    No Known Activations