INDEX
    Explanations

    mentions of tweets

    occurrences of the word "tweet"

    New Auto-Interp
    Negative Logits
    CLUD
    -0.66
     Gent
    -0.64
     Myth
    -0.64
    vantage
    -0.63
     Brill
    -0.62
     Starr
    -0.61
    inav
    -0.60
     Somers
    -0.60
     Valkyrie
    -0.60
     Newport
    -0.59
    POSITIVE LOGITS
    storms
    1.14
    Tweet
    1.02
    storm
    0.99
     Tweet
    0.99
     tweets
    0.93
     hashtag
    0.90
     tweet
    0.89
    weet
    0.87
    deck
    0.87
     hasht
    0.85
    Act Density 0.018%

    No Known Activations