INDEX
Explanations
mentions of social media posts specifically of the action of tweeting
instances of the word "tweeted" and its variations, indicating social media activity
New Auto-Interp
Negative Logits
comprom
-0.79
cised
-0.69
inventoryQuantity
-0.67
vantage
-0.64
cius
-0.61
glomer
-0.61
ãĥĺ
-0.61
vernment
-0.61
ãĥ´
-0.60
ricular
-0.60
POSITIVE LOGITS
"@
1.12
condolences
0.98
"#
0.94
congratulations
0.94
encouragement
0.85
pics
0.84
sarcast
0.83
angrily
0.83
screenshots
0.83
photos
0.82
Activations Density 0.053%