INDEX
Explanations
phrases related to decision making and choices
New Auto-Interp
Negative Logits
brance
-0.88
ptoms
-0.77
ijk
-0.71
oun
-0.71
igi
-0.68
qv
-0.67
peria
-0.66
emic
-0.66
aura
-0.65
issance
-0.64
POSITIVE LOGITS
wisely
0.88
chose
0.82
chooses
0.79
chosen
0.77
randomly
0.75
choose
0.75
choice
0.73
choosing
0.73
choices
0.66
axe
0.66
Activations Density 0.042%