INDEX
Explanations
words related to urging or encouraging actions
phrases that direct or urge action towards specific subjects
New Auto-Interp
Negative Logits
_.
-0.71
Detected
-0.68
traumatic
-0.67
holes
-0.65
Appears
-0.65
washer
-0.62
closed
-0.60
Yep
-0.60
quickShipAvailable
-0.60
cloth
-0.59
POSITIVE LOGITS
avoid
1.03
participate
0.98
undertake
0.97
minimize
0.93
keep
0.93
reconsider
0.92
emulate
0.92
learn
0.91
maximize
0.91
embrace
0.91
Activations Density 0.210%