“…Matrix is reordered to help forming blocks around the diagonal and off-diagonal to improve the locality. 9,10,[30][31][32]34 In Reference 9, authors use dense BLAS operations to compute dense off-diagonal blocks. Dense matrix computations have higher parallelism than their sparse counterparts, therefore, creating dense blocks in sparse matrices is an effective method to improve performance.…”
Section: Related Workmentioning
confidence: 99%
“…It is a method practiced on CPUs more than on GPUs. Matrix is reordered to help forming blocks around the diagonal and off‐diagonal to improve the locality 9,10,30‐32,34 . In Reference 9, authors use dense BLAS operations to compute dense off‐diagonal blocks.…”
Section: Related Workmentioning
confidence: 99%
“…Different approaches have been taken to attack this problem which are based on partitioning the matrix into smaller pieces and solving them with the appropriate method and/or on the appropriate architecture. “Block‐diagonal based methods” is an umbrella term which is used for such partitioning methods for SpTRSV on CPUs 6 . Partitioning approach is a natural approach for SpTRSV and it has many successful examples in the literature 7‐12 .…”
SummarySparse triangular solve (SpTRSV) is an extensively studied computational kernel. An important obstacle in parallel SpTRSV implementations is that in some parts of a sparse matrix the computation is serial. By transforming the dependency graph, it is possible to increase the parallelism of the parts that lack it. In this work, we present a novel graph transformation strategy to increase the parallelism degree of a sparse matrix and compare it to our previous strategy. It is seen that our transformation strategy can provide a speedup as high as .
“…Matrix is reordered to help forming blocks around the diagonal and off-diagonal to improve the locality. 9,10,[30][31][32]34 In Reference 9, authors use dense BLAS operations to compute dense off-diagonal blocks. Dense matrix computations have higher parallelism than their sparse counterparts, therefore, creating dense blocks in sparse matrices is an effective method to improve performance.…”
Section: Related Workmentioning
confidence: 99%
“…It is a method practiced on CPUs more than on GPUs. Matrix is reordered to help forming blocks around the diagonal and off‐diagonal to improve the locality 9,10,30‐32,34 . In Reference 9, authors use dense BLAS operations to compute dense off‐diagonal blocks.…”
Section: Related Workmentioning
confidence: 99%
“…Different approaches have been taken to attack this problem which are based on partitioning the matrix into smaller pieces and solving them with the appropriate method and/or on the appropriate architecture. “Block‐diagonal based methods” is an umbrella term which is used for such partitioning methods for SpTRSV on CPUs 6 . Partitioning approach is a natural approach for SpTRSV and it has many successful examples in the literature 7‐12 .…”
SummarySparse triangular solve (SpTRSV) is an extensively studied computational kernel. An important obstacle in parallel SpTRSV implementations is that in some parts of a sparse matrix the computation is serial. By transforming the dependency graph, it is possible to increase the parallelism of the parts that lack it. In this work, we present a novel graph transformation strategy to increase the parallelism degree of a sparse matrix and compare it to our previous strategy. It is seen that our transformation strategy can provide a speedup as high as .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.