2017 IEEE Custom Integrated Circuits Conference (CICC) 2017
DOI: 10.1109/cicc.2017.7993626
|View full text |Cite
|
Sign up to set email alerts
|

Hardware for machine learning: Challenges and opportunities

Abstract: Abstract-Machine learning plays a critical role in extracting meaningful information out of the zetabytes of sensor data collected every day. For some applications, the goal is to analyze and understand the data to identify trends (e.g., surveillance, portable/wearable electronics); in other applications, the goal is to take immediate action based the data (e.g., robotics/drones, self-driving cars, smart Internet of Things). For many of these applications, local embedded processing near the sensor is preferred… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
111
0
6

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 198 publications
(134 citation statements)
references
References 60 publications
0
111
0
6
Order By: Relevance
“…There exists a severe contradiction between the complex model and the limited computational resources. Although at present, a large amount of dedicated hardware emerges for deep learning [16,17,18,19,20], providing efficient vector operations to enable fast convolution in forward inference, From the aspect of explainable machine learning, we can summarize that some filters are playing a similar role in the model, especially when the model size is large. So it is reasonable to prune some useless filters or reduce their precision to lower bits.…”
Section: Introductionmentioning
confidence: 99%
“…There exists a severe contradiction between the complex model and the limited computational resources. Although at present, a large amount of dedicated hardware emerges for deep learning [16,17,18,19,20], providing efficient vector operations to enable fast convolution in forward inference, From the aspect of explainable machine learning, we can summarize that some filters are playing a similar role in the model, especially when the model size is large. So it is reasonable to prune some useless filters or reduce their precision to lower bits.…”
Section: Introductionmentioning
confidence: 99%
“…The proposed algorithm was written in C++, QVGA resolution frames (320×240 pixels) are processed and the maximum RAM usage is 1.6 GB. Following recent trends and suggestions of other authors, which claim a jointly software and hardware design, the results of the present work focus on the frame rate and power consumption of the algorithm, which are hardware dependent [13], [45].…”
Section: Resultsmentioning
confidence: 99%
“…Consumption is one of the main limiting factors for deep learning application on embedded IoT end-nodes. The opportunities that CNNs offer for image processing are undeniable but their practical use and expansion will highly depend on the hardware design and the development of hardwareoriented algorithms [45]. The energy consumption of CNNs is dominated by data movement instead of the computation itself [13].…”
Section: Power Consumptionmentioning
confidence: 99%
“…• Improvement in learning efficiency: Efficiency of learning could include some aspects such as the number of required parameters, memory or storage requirements, computational time and training convergence rate. Obviously, fewer but optimal parameters and lower memory requirements are desirable in deploying such algorithms on conventional devices such as mobile phones and PCs [59].…”
Section: Joint Tasks Via Multitask Learningmentioning
confidence: 99%