INDEX
Explanations
newsletters to sign up for
occurrences of the term "Newsletter" and similar structured text passages
New Auto-Interp
Negative Logits
come
-0.87
had
-0.78
roo
-0.76
grave
-0.70
runs
-0.68
comes
-0.68
being
-0.66
equality
-0.66
haul
-0.64
bows
-0.63
POSITIVE LOGITS
Sign
0.89
inbox
0.88
subscriptions
0.87
subscribers
0.81
Typ
0.80
Emails
0.80
subscriber
0.78
Newsletter
0.77
subscription
0.75
Subscribe
0.74
Activations Density 0.011%