INDEX
Explanations
terms related to decision-making
references to decisions and judgment-making processes
New Auto-Interp
Negative Logits
uum
-0.73
agogue
-0.67
hibit
-0.66
itute
-0.64
strip
-0.64
esc
-0.63
ew
-0.62
porter
-0.61
chant
-0.60
uu
-0.60
POSITIVE LOGITS
decisions
3.67
decision
2.31
choices
2.26
judgments
2.10
rulings
1.85
Decision
1.74
conclusions
1.61
mistakes
1.58
actions
1.49
opinions
1.48
Activations Density 0.016%