INDEX
Explanations
declarations or promises of things that will not happen
negations or expressions of refusal
New Auto-Interp
Negative Logits
wrapper
-0.69
Handbook
-0.67
Measures
-0.66
Intern
-0.65
Seasons
-0.65
Nice
-0.64
Maker
-0.63
quickShipAvailable
-0.63
Strategy
-0.62
Nine
-0.62
POSITIVE LOGITS
necessarily
1.24
icably
1.20
tolerate
1.04
icable
1.03
bother
1.00
bud
0.99
hesitate
0.97
suffice
0.89
be
0.88
interfere
0.87
Activations Density 0.088%