INDEX
Explanations
keywords related to calls to action, instructions, or directives
calls to action or prompts related to obtaining information or following updates
New Auto-Interp
Negative Logits
minist
-0.73
)",
-0.71
().
-0.60
respectively
-0.60
session
-0.60
laying
-0.60
UD
-0.58
maturity
-0.58
swe
-0.57
,,,,
-0.57
POSITIVE LOGITS
Alert
0.89
Continued
0.88
SHARES
0.85
WATCHED
0.85
Featured
0.84
Correction
0.84
Expand
0.83
Slate
0.83
0.82
Follow
0.82
Activations Density 0.178%