Morfometria dos canais e anéis inguinais de fetos natimortos e cadáveres adultos humanos e sua relação com as hérnias inguinais

In this paper, we describe scalable parallel algorithms for sparse matrix factorization, analyze their performance and scalability, and present experimental results for up to 1024 processors on a Cray T3D parallel computer. Through our analysis and experimental results, we demonstrate that our algorithms substantially improve the state of the art in parallel direct solution of sparse linear systems-both in terms of scalability and overall performance. It is a well known fact that dense matrix factorization scales well and can be implemented efficiently on parallel computers. In this paper, we present the first algorithms to factor a wide class of sparse matrices (including those arising from two-and three-dimensional finite element problems) that are asymptotically as scalable as dense matrix factorization algorithms on a variety of parallel architectures. Our algorithms incur less communication overhead and are more scalable than any previously known parallel formulation of sparse matrix factorization. Although, in this paper, we discuss Cholesky factorization of symmetric positive definite matrices, the algorithms can be adapted for solving sparse linear least squares problems and for Gaussian elimination of diagonally dominant matrices that are almost symmetric in structure. An implementation of one of our sparse Cholesky factorization algorithms delivers up to 20 GFlops on a Cray T3D for medium-size structural engineering and linear programming problems. To the best of our knowledge, this is the highest performance ever obtained for sparse Cholesky factorization on any supercomputer.

show abstract

Revisiting Asynchronous Linear Solvers: Provable Convergence Rate through Randomization

Avron

Druinsky

Gupta

2014

View full text Add to dashboard Cite

Asynchronous methods for solving systems of linear equations have been researched since Chazan and Miranker's pioneering 1969 paper on chaotic relaxation. The underlying idea of asynchronous methods is to avoid processor idle time by allowing the processors to continue to make progress even if not all progress made by other processors has been communicated to them.Historically, the applicability of asynchronous methods for solving linear equations was limited to certain restricted classes of matrices, such as diagonally dominant matrices. Furthermore, analysis of these methods focused on proving convergence in the limit. Comparison of the asynchronous convergence rate with its synchronous counterpart and its scaling with the number of processors were seldom studied, and are still not well understood.In this paper, we propose a randomized shared-memory asynchronous method for general symmetric positive definite matrices. We rigorously analyze the convergence rate and prove that it is linear, and is close to that of the method's synchronous counterpart if the processor count is not excessive relative to the size and sparsity of the matrix. We also present an algorithm for unsymmetric systems and overdetermined least-squares. Our work presents a significant improvement in the applicability of asynchronous linear solvers as well as in their convergence analysis, and suggests randomization as a key paradigm to serve as a foundation for asynchronous methods.

show abstract

Recent advances in direct methods for solving unsymmetric sparse systems of linear equations

Gupta

2002

ACM Trans. Math. Softw.

View full text Add to dashboard Cite

During the past few years, algorithmic improvements alone have reduced the time required for the direct solution of unsymmetric sparse systems of linear equations by almost an order of magnitude. This paper compares the performance of some well-known software packages for solving general sparse systems. In particular, it demonstrates the consistently high level of performance achieved by WSMP-the most recent of such solvers. It compares the various algorithmic components of these solvers and discusses their impact on solver performance. Our experiments show that the algorithmic choices made in WSMP enable it to run more than twice as fast as the best among similar solvers and that WSMP can factor some of the largest sparse matrices available from real applications in only a few seconds on a 4-CPU workstation. Thus, the combination of advances in hardware and algorithms makes it possible to solve those general sparse linear systems quickly and easily that might have been considered too large until recently.

show abstract

Fast and effective algorithms for graph partitioning and sparse-matrix ordering

Gupta

1997

IBM J. Res. & Dev.

View full text Add to dashboard Cite

Graph partitioning is a fundamental problem in several scientific and engineering applications. In this paper, we describe heuristics that improve the state-of-the-art practical algorithms used in graph-partitioning software in terms of both partitioning speed and quality. An important use of graph partitioning is in ordering sparse matrices for obtaining direct solutions to sparse systems of linear equations arising in engineering and optimization applications. The experiments reported in this paper show that the use of these heuristics results in a considerable improvement in the quality of sparse-matrix orderings over conventional ordering methods, especially for sparse matrices arising in linear programming problems. In addition, our graph-partitioningbased ordering algorithm is more parallelizable than minimum-degree-based ordering algorithms, and it renders the ordered matrix more amenable to parallel factorization.

show abstract

Analyzing Scalability of Parallel Algorithms and Architectures

Kumar¹,

Gupta²

1994

Journal of Parallel and Distributed Computing

135

View full text Add to dashboard Cite

Improved Symbolic and Numerical Factorization Algorithms for Unsymmetric Sparse Matrices

Gupta¹

2002

SIAM J. Matrix Anal. & Appl.

View full text Add to dashboard Cite

We present algorithms for the symbolic and numerical factorization phases in the direct solution of sparse unsymmetric systems of linear equations. We have modi ed a classical symbolic factorization algorithm for unsymmetric matrices to inexpensively compute minimal elimination structures. We give an e cient algorithm to compute a near-minimal data-dependency graph that is valid irrespective of the amount of dynamic pivoting performed during numerical factorization. Finally, we describe an unsymmetric-pattern multifrontal algorithm for Gaussian elimination with partial pivoting that uses the task-and data-dependency graphs computed during the symbolic phase. These algorithms have been implemented in WSMP|an industrial strength sparse solver package|and have enabled WSMP to signi cantly outperform other similar solvers. We present experimental results to demonstrate the merits of the new algorithms.

show abstract

Adaptive Techniques for Improving the Performance of Incomplete Factorization Preconditioning

Gupta¹,

George²

2010

SIAM J. Sci. Comput.

View full text Add to dashboard Cite

Scalable Parallel Algorithms for Unstructured Problems

Kumar

Grama

Gupta

et al. 1995

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Anshul Gupta

Highly scalable parallel algorithms for sparse matrix factorization

Revisiting Asynchronous Linear Solvers: Provable Convergence Rate through Randomization

Recent advances in direct methods for solving unsymmetric sparse systems of linear equations

Fast and effective algorithms for graph partitioning and sparse-matrix ordering

Analyzing Scalability of Parallel Algorithms and Architectures

Improved Symbolic and Numerical Factorization Algorithms for Unsymmetric Sparse Matrices

Adaptive Techniques for Improving the Performance of Incomplete Factorization Preconditioning

Scalable Parallel Algorithms for Unstructured Problems

Contact Info

Product

Resources

About