“…For the problems that we consider in this paper, the parallel DP algorithms were already discussed by a rich literature in the eighties and nighties (e.g., [49,51,42,58,57,72]). Later work not only considers parallelism, but also optimizes symmetric cache complexity (e.g., [46,34,36,31,20,60,77,74,75,41,73,32]). The algorithms in linear algebra that share the similar computation structures (but with different orders in the computation) are also discussed (e.g., [36,41,83,78,25,40,11,65]).…”