INDEX
Explanations
instances of decision-making and choices
phrases about making choices or decisions
New Auto-Interp
Negative Logits
metadata
-0.79
rote
-0.69
scope
-0.68
rites
-0.67
meta
-0.67
bound
-0.66
estones
-0.66
visible
-0.66
workings
-0.65
hooting
-0.65
POSITIVE LOGITS
either
0.97
Either
0.93
Either
0.92
either
0.86
Option
0.78
whichever
0.78
option
0.75
choose
0.73
chose
0.73
Carbuncle
0.72
Activations Density 0.370%