INDEX
Explanations
verbs followed by the word "to" indicating an action or intention
phrases indicating desires or intentions
New Auto-Interp
Negative Logits
Orders
-0.71
Lines
-0.70
Measures
-0.62
iple
-0.62
Directions
-0.61
enforcement
-0.61
requires
-0.61
cases
-0.60
quickShipAvailable
-0.59
words
-0.59
POSITIVE LOGITS
stay
1.08
be
1.06
keep
1.00
give
0.93
emulate
0.89
participate
0.88
take
0.88
make
0.87
avoid
0.86
perform
0.86
Activations Density 0.117%