INDEX
Explanations
social media handles or usernames
references to specific individuals or accounts on social media
New Auto-Interp
Negative Logits
terday
-0.65
Majesty
-0.65
phyl
-0.63
bragging
-0.62
unrecogn
-0.61
naire
-0.61
naires
-0.60
cass
-0.60
intangible
-0.59
stomp
-0.59
POSITIVE LOGITS
iframe
0.95
News
0.82
Politics
0.78
Report
0.76
Subscribe
0.73
Brow
0.73
Follow
0.72
sports
0.72
Monitor
0.71
Latest
0.69
Activations Density 0.179%