INDEX
Explanations
phrases related to making choices or decisions
instances of the word "chose" or its variations
New Auto-Interp
Negative Logits
brance
-0.83
uum
-0.80
ijk
-0.73
peria
-0.71
loo
-0.71
igi
-0.71
aura
-0.67
bley
-0.66
emic
-0.66
ptoms
-0.65
POSITIVE LOGITS
wisely
0.87
chose
0.82
chosen
0.81
chooses
0.79
axe
0.77
choices
0.77
choose
0.76
choice
0.76
randomly
0.71
choosing
0.70
Activations Density 0.038%