INDEX
Explanations
social media platform names
mentions of popular social media platforms
New Auto-Interp
Negative Logits
~~~~~~~~~~~~~~~~
-0.74
emate
-0.67
nants
-0.67
ibus
-0.66
ajor
-0.65
lass
-0.64
onies
-0.64
zin
-0.63
20439
-0.62
cised
-0.62
POSITIVE LOGITS
Analytics
0.90
Messenger
0.86
Users
0.86
Labs
0.86
users
0.84
Gate
0.81
Bot
0.78
Browser
0.77
API
0.77
user
0.76
Activations Density 0.084%