INDEX
Explanations
mentions of tweets
occurrences of the word "tweet"
New Auto-Interp
Negative Logits
CLUD
-0.66
Gent
-0.64
Myth
-0.64
vantage
-0.63
Brill
-0.62
Starr
-0.61
inav
-0.60
Somers
-0.60
Valkyrie
-0.60
Newport
-0.59
POSITIVE LOGITS
storms
1.14
Tweet
1.02
storm
0.99
Tweet
0.99
tweets
0.93
hashtag
0.90
tweet
0.89
weet
0.87
deck
0.87
hasht
0.85
Activations Density 0.018%