INDEX
Explanations
phrases related to making choices or decisions
various forms of the verb "opt," indicating choices or preferences
New Auto-Interp
Negative Logits
nces
-0.68
borough
-0.65
esville
-0.64
flies
-0.63
Effective
-0.62
Danger
-0.61
rival
-0.61
capacity
-0.60
riage
-0.59
Famous
-0.58
POSITIVE LOGITS
opt
0.95
opting
0.88
nir
0.88
opted
0.86
atis
0.83
uary
0.83
atory
0.81
atively
0.79
wisely
0.74
aye
0.73
Activations Density 0.033%