INDEX
Explanations
occurrences of mathematical symbols or notation
mathematical and scientific notation
New Auto-Interp
Negative Logits
betweenstory
-0.60
Lugo
-0.56
Meksiku
-0.56
bookstore
-0.55
ondissement
-0.55
Soria
-0.55
AssemblyCompany
-0.54
ویکیآمباردا
-0.54
ambique
-0.54
rrggbb
-0.54
POSITIVE LOGITS
\
1.01
\
0.77
)\
0.70
<bos>
0.68
}\
0.64
#\
0.63
]\
0.62
))\
0.59
&\
0.59
$\
0.58
Activations Density 0.218%