2024
DOI: 10.1145/3607870
|View full text |Cite
|
Sign up to set email alerts
|

SMaLL: Software for Rapidly Instantiating Machine Learning Libraries

Abstract: Interest in deploying deep neural network (DNN) inference on edge devices has resulted in an explosion of the number and types of hardware platforms that machine learning (ML) libraries must support. High-level programming interfaces, such as TensorFlow, can be readily ported across different devices; however, maintaining performance when porting the low-level implementation is more nuanced. High-performance inference implementations require an effective mapping of the high-level interface to the target hardwa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 9 publications
0
1
0
Order By: Relevance
“…Specialized data layouts. Many prior works have sought to optimize convolution operations by introducing specialized data formats that allow for continuous memory accesses and direct use of SIMD instructions and FMA units [24,31,61,67,69]. These approaches have demonstrated promising convolution performance 4.…”
Section: Related Workmentioning
confidence: 99%
“…Specialized data layouts. Many prior works have sought to optimize convolution operations by introducing specialized data formats that allow for continuous memory accesses and direct use of SIMD instructions and FMA units [24,31,61,67,69]. These approaches have demonstrated promising convolution performance 4.…”
Section: Related Workmentioning
confidence: 99%