INDEX
Explanations
words related to decision-making
terms related to decision-making processes
New Auto-Interp
Negative Logits
amina
-0.71
bing
-0.70
aired
-0.64
awar
-0.64
ateur
-0.63
bid
-0.63
Dak
-0.63
izont
-0.63
oby
-0.63
vae
-0.63
POSITIVE LOGITS
decisions
1.08
makers
0.81
choices
0.80
maker
0.74
wisely
0.73
regarding
0.72
affecting
0.72
ACTIONS
0.71
decision
0.71
involving
0.68
Activations Density 0.035%