INDEX
Explanations
tweets mentioning images
references to social media activities, particularly related to posting images and tweets
New Auto-Interp
Negative Logits
laboratories
-0.84
Trials
-0.77
treaties
-0.76
Flavoring
-0.74
enary
-0.74
defects
-0.73
unequ
-0.72
Lect
-0.71
asbestos
-0.71
museums
-0.70
POSITIVE LOGITS
retweet
1.54
"@
1.44
hasht
1.41
Redditor
1.41
"#
1.33
1.23
1.22
hashtag
1.19
tweeted
1.19
username
1.18
Activations Density 0.819%