INDEX
Explanations
email subscription-related phrases
phrases related to email subscriptions for news updates
New Auto-Interp
Negative Logits
lihood
-0.76
Classification
-0.61
derog
-0.58
clarification
-0.58
Malf
-0.57
BSD
-0.57
MpServer
-0.57
Figure
-0.56
beit
-0.56
dracon
-0.56
POSITIVE LOGITS
subscribe
0.86
letters
0.84
Subscribe
0.83
interstitial
0.81
Subscribe
0.80
alerts
0.78
inbox
0.75
rss
0.71
subscription
0.70
subscribers
0.70
Activations Density 0.119%