“…There are a number of library generators that automatically produce high-performance kernels, including FFT [13,29,36], BLAS [34,37,5,14,16], Sparse Numerical Computation [19,33,26,4,23], and domain specific routines [3,7,24]. Recent research [21,22,17] expands automatic code generation to routines whose performance depends not only on architectural features, but also on input characteristics.…”