INDEX
Explanations
covert instructions or hidden terms within messages
criteria for eligibility in competitions or promotions
New Auto-Interp
Negative Logits
ĸļ
-0.68
razen
-0.67
disadvant
-0.63
overshadow
-0.62
undermin
-0.61
ãĥ³ãĤ¸
-0.59
pher
-0.58
Ò
-0.58
behavi
-0.57
ppard
-0.57
POSITIVE LOGITS
Cancel
0.88
Submit
0.83
Login
0.79
Shipping
0.77
ORDER
0.77
Refresh
0.75
Payment
0.74
Purchase
0.74
Apply
0.74
Username
0.74
Activations Density 0.806%