INDEX
Explanations
social media platform names
mentions of social media platforms
New Auto-Interp
Negative Logits
bilt
-0.78
jri
-0.77
rals
-0.73
èª
-0.69
schild
-0.68
plane
-0.65
ECD
-0.65
uits
-0.62
sbm
-0.61
ACTED
-0.61
POSITIVE LOGITS
0.95
0.88
Tumblr
0.86
0.85
Comments
0.84
PHOTO
0.79
Likes
0.78
0.77
0.77
ileaks
0.75
Activations Density 0.035%