INDEX
Explanations
numerical references and counts
New Auto-Interp
Negative Logits
Thousands
-0.18
BOTH
-0.17
both
-0.17
Numerous
-0.17
thousands
-0.16
åIJĦç§į
-0.16
Various
-0.15
amen
-0.15
許
-0.15
éĤ£äºĽ
-0.15
POSITIVE LOGITS
different
0.32
dozen
0.29
separate
0.28
sets
0.28
/all
0.26
different
0.26
-thirds
0.25
teenth
0.25
of
0.25
seperate
0.25
Activations Density 0.294%