INDEX
Explanations
instances where the concept of "choice" is mentioned
references to the concept of choice in various contexts
New Auto-Interp
Negative Logits
uum
-0.86
velop
-0.74
vae
-0.74
awar
-0.72
itz
-0.72
zona
-0.71
steen
-0.71
monton
-0.71
peria
-0.71
brance
-0.70
POSITIVE LOGITS
choices
1.09
choice
1.01
axe
0.85
options
0.78
Altern
0.78
chose
0.78
Choice
0.77
Choice
0.76
é¾įå¥ij士
0.76
ACTIONS
0.75
Activations Density 0.028%