Neural Acceleration for General-Purpose Approximate Programs

Esmaeilzadeh, Hadi; Sampson, Adrian; Ceze, Luís; Burger, Doug

doi:10.1109/micro.2012.48

Cited by 524 publications

(304 citation statements)

References 34 publications

Supporting

Mentioning

300

Contrasting

Unclassified

Order By: Relevance

“…One possible reason is that many past studies have used cores that did not trigger some errors we observed. For example, older and simpler cores like Atom and Penryn have lower issue widths, so studies using them [13,18,42,51,55] avoid read/write port overestimates, one of the major error sources we observed. Penryn cores also do not support SMT, eliminating the duplication of hardware error.…”

Section: Discussion and Guidelinesmentioning

confidence: 97%

Quantifying sources of error in McPAT and potential impacts on architectural studies

Jacobson

Bose

et al. 2015

2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA)

View full text Add to dashboard Cite

Architectural power modeling tools are widely used by the computer architecture community for rapid evaluations of high-level design choices and design space explorations. Currently, McPAT [31] is the de facto power model, but the literature does not yet contain a careful examination of its modeling accuracy. In addition, the issue of how greatly power modeling error can affect architectural-level studies has not been quantified before. In this work, we present the first rigorous assessment of McPAT's core power and area models with a detailed, validated power modeling toolchain used in current industrial practice. We find that McPAT's predictions can have significant error because some of the models are either incomplete, too high-level, or assume implementations of structures that differ from that of the core at hand. We demonstrate that large errors are possible when using McPAT's dynamic power estimates in the context of voltage noise and thermal hotspots, but for steady-state properties, accurately modeling leakage power is more important. Based on our analysis, we are able to provide guidelines for creating accurate McPAT models, even without access to detailed industrial power modeling tools. We conclude that in spite of its accuracy gaps, McPAT is still a very useful tool for many architectural studies, and its limitations can often be adequately addressed for a given research study of interest.

show abstract

Section: Discussion and Guidelinesmentioning

confidence: 97%

Quantifying sources of error in McPAT and potential impacts on architectural studies

Jacobson

Bose

et al. 2015

2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA)

View full text Add to dashboard Cite

show abstract

“…Recently, there has been significant interest in non-traditional acceleration platforms, such as approximate computing designs, some of which have targeted the computer vision space. In [13], a dynamically trainable neural network accelerator is trained on regions of image processing algorithms and replicates their output with a high degree of accuracy and significant efficiency gains compared with a general purpose processor. In [30] voltage-coupled oscillators are used to build pattern-matching operators that in turn perform image processing tasks.…”

Section: Related Workmentioning

confidence: 99%

Exploring architectural heterogeneity in intelligent vision systems

Chandramoorthy

Tagliavini

Irick

et al. 2015

2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA)

View full text Add to dashboard Cite

Limited power budgets and the need for high performance computing have led to platform customization with a number of accelerators integrated with CMPs. In order to study customized architectures, we model four customization design points and compare their performance and energy across a number of computer vision workloads. We analyze the limitations of generic architectures and quantify the costs of increasing customization using these micro-architectural design points. This analysis leads us to develop a framework consisting of low-power multi-cores and an array of configurable micro-accelerator functional units. Using this platform, we illustrate data flow and control processing optimizations that provide for performance gains similar to custom ASICs for a wide range of vision benchmarks.

show abstract

“…First, we note that many high performance applications, such as the "Recognition, Mining, and Synthesis (RMS)" workload from Intel [21], are based on heuristics that can be approximated (e.g. with the use of neural networks [24]). Second, applications that use exact computation also often include regions of computation that are tolerant to imprecision, or "approximable", even if these regions can only be circumstantially approximated (e.g.…”

Section: Introductionmentioning

confidence: 99%

BRAINIAC: Bringing reliable accuracy into neurally-implemented approximate computing

Grigorian¹,

Farahpour²,

Reinman³

2015

2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA)

View full text Add to dashboard Cite

Applications with large amounts of data, real-time constraints, ultra-low power requirements, and heavy computational complexity present significant challenges for modern computing systems, and often fall within the category of high performance computing (HPC). As such, computer architects have looked to high performance single instruction multiple data (SIMD) architectures, such as accelerator-rich platforms, for handling these workloads. However, since the results of these applications do not always require exact precision, approximate computing may also be leveraged. In this work, we introduce BRAINIAC, a heterogeneous platform that combines precise accelerators with neural-network-based approximate accelerators. These reconfigurable accelerators are leveraged in a multi-stage flow that begins with simple approximations and resorts to more complex ones as needed. We employ high-level, applicationspecific light-weight checks (LWCs) to throttle this multi-stage acceleration flow and reliably ensure user-specified accuracy at runtime. Evaluation of the performance and energy of our heterogeneous platform for error tolerance thresholds of 5%-25% demonstrates an average of 3× gain over computation that only includes precise acceleration, and 15×-35× gain over softwarebased computation.

show abstract

Neural Acceleration for General-Purpose Approximate Programs

Cited by 524 publications

References 34 publications

Quantifying sources of error in McPAT and potential impacts on architectural studies

Quantifying sources of error in McPAT and potential impacts on architectural studies

Exploring architectural heterogeneity in intelligent vision systems

BRAINIAC: Bringing reliable accuracy into neurally-implemented approximate computing

Contact Info

Product

Resources

About