INDEX
Explanations
questions and discussions about compromise and simplicity in decision-making
New Auto-Interp
Negative Logits
alon
-0.16
liž
-0.15
nonnull
-0.15
_compat
-0.15
òng
-0.14
compl
-0.14
alus
-0.14
yles
-0.14
jeme
-0.14
anford
-0.14
POSITIVE LOGITS
simply
0.54
Simply
0.43
simplement
0.42
Simply
0.40
пÑĢоÑģÑĤо
0.37
just
0.34
einfach
0.33
ê·¸ëĥ¥
0.30
缴æİ¥
0.30
simples
0.29
Activations Density 0.214%