INDEX
Explanations
Twitter handles
mentions of Twitter handles or social media usernames
New Auto-Interp
Negative Logits
Takeru
-0.82
vessels
-0.69
outnumbered
-0.69
aspir
-0.66
unsus
-0.65
exorc
-0.65
crossover
-0.65
favors
-0.64
assum
-0.63
stricken
-0.63
POSITIVE LOGITS
username
1.01
#$
0.93
Coach
0.91
realDonaldTrump
0.90
deck
0.88
(@
0.85
Tweet
0.82
Official
0.82
nick
0.80
photos
0.79
Activations Density 0.013%