INDEX
Explanations
plateau or crest-related terms
terms related to stability and levels in various contexts
New Auto-Interp
Negative Logits
portation
-0.75
alez
-0.69
RH
-0.64
ilies
-0.60
att
-0.58
cas
-0.58
opal
-0.58
unal
-0.58
olly
-0.58
ĪĴ
-0.58
POSITIVE LOGITS
plateau
1.38
peaks
0.90
fall
0.76
edIn
0.76
Wem
0.73
crest
0.71
zx
0.71
=-=-
0.70
hene
0.68
Archdemon
0.68
Activations Density 0.004%