INDEX
Explanations
phrases related to decision-making
mentions of decision-making processes
New Auto-Interp
Negative Logits
anti
-0.80
gel
-0.68
icas
-0.67
amen
-0.66
oil
-0.63
Versions
-0.63
ewitness
-0.63
unity
-0.62
iday
-0.61
ppo
-0.60
POSITIVE LOGITS
decisions
1.05
makers
0.90
regarding
0.86
maker
0.86
decision
0.79
whether
0.77
ACTIONS
0.77
concerning
0.76
affecting
0.75
ptions
0.74
Activations Density 0.042%