The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2023
DOI: 10.48550/arxiv.2303.04739
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Advancing Direct Convolution using Convolution Slicing Optimization and ISA Extensions

Abstract: Convolution is one of the most computationally intensive operations that must be performed for machine-learning model inference. A traditional approach to compute convolutions is known as the Im2Col + BLAS method. This paper proposes SConv: a direct-convolution algorithm based on a MLIR/LLVM code-generation toolchain that can be integrated into machine-learning compilers . This algorithm introduces: (a) Convolution Slicing Analysis (CSA) -a convolution-specific 3D cache-blocking analysis pass that focuses on t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…While promising, LIBXSMM has two fundamental drawbacks. First, its data layout design is incompatible with the common data layouts (i.e., 𝑁𝐶𝐻𝑊 or 𝑁 𝐻𝑊 𝐶) 1 used in mainstreamed deep learning (DL) frameworks [8,13,30]. Therefore, integrating the BRGEMM routines into DL frameworks requires either code refactoring to the underlying DL framework or introducing a format conversion stage at the user code when calling and exiting each CONV operator.…”
Section: Introductionmentioning
confidence: 99%
“…While promising, LIBXSMM has two fundamental drawbacks. First, its data layout design is incompatible with the common data layouts (i.e., 𝑁𝐶𝐻𝑊 or 𝑁 𝐻𝑊 𝐶) 1 used in mainstreamed deep learning (DL) frameworks [8,13,30]. Therefore, integrating the BRGEMM routines into DL frameworks requires either code refactoring to the underlying DL framework or introducing a format conversion stage at the user code when calling and exiting each CONV operator.…”
Section: Introductionmentioning
confidence: 99%