INDEX
Explanations
phrases related to having multiple choices or alternatives
references to variety and availability in choices or options
New Auto-Interp
Negative Logits
bug
-0.76
awar
-0.69
weight
-0.66
roy
-0.66
pub
-0.65
wig
-0.63
master
-0.62
ardy
-0.62
tein
-0.62
weights
-0.61
POSITIVE LOGITS
options
1.27
ensical
1.12
choices
1.02
options
0.90
alternatives
0.88
Options
0.86
atives
0.86
olutions
0.85
pring
0.84
etting
0.79
Activations Density 0.046%