INDEX
Explanations
words related to decision-making and actions related to decision outcomes
New Auto-Interp
Negative Logits
soDeliveryDate
-0.65
reports
-0.64
breaks
-0.64
ups
-0.62
fortunately
-0.61
listed
-0.61
indications
-0.57
ergus
-0.57
cases
-0.57
tips
-0.56
POSITIVE LOGITS
invoke
1.02
seek
1.01
engage
1.00
maximize
0.98
communicate
0.95
promote
0.95
achieve
0.93
raise
0.93
pursue
0.93
explore
0.92
Activations Density 9.692%