INDEX
Explanations
references to subscriptions or sign-up actions related to newsletters or emails
New Auto-Interp
Negative Logits
ilts
-0.81
erness
-0.75
areth
-0.67
nesses
-0.64
ossus
-0.63
uthor
-0.63
ahime
-0.62
Tracks
-0.62
anges
-0.62
eters
-0.62
POSITIVE LOGITS
inbox
0.78
totaling
0.78
roundup
0.71
dialog
0.70
ðŁ
0.69
trending
0.69
playlist
0.65
selfie
0.63
secut
0.62
0.62
Activations Density 0.007%