INDEX
Explanations
numerical comparisons and measurements related to quantities or thresholds
New Auto-Interp
Negative Logits
igos
-0.16
antan
-0.15
]={↵-0.15
alf
-0.14
sto
-0.14
lec
-0.14
/md
-0.14
æħ¶
-0.14
ãģ¤ãģ¶
-0.13
.trace
-0.13
POSITIVE LOGITS
Abel
0.16
stered
0.15
than
0.15
šek
0.14
ADER
0.14
compact
0.14
ãĥ«
0.14
.rnn
0.14
unity
0.14
zech
0.14
Activations Density 0.156%