Won Jeon scite author profile

Won Jeon

5Publications

25Citation Statements Received

57Citation Statements Given

How they've been cited

How they cite others

Affiliations

Electronics and Telecommunications Research Institute, Yonsei University, Dong-A University

Publications

Order By: Most citations

Hi-End: Hierarchical, Endurance-Aware STT-MRAM-Based Register File for Energy-Efficient GPUs

et al. 2020

View full text Add to dashboard Cite

Modern Graphics Processing Units (GPUs) require large hardware resources for massive parallel thread executions. In particular, modern GPUs have a large register file composed of Static Random Access Memory (SRAM). Due to the high leakage current of SRAM, the register file consumes approximately 20% of the total GPU energy. The energy efficiency of the register file becomes more critical as the throughput of GPUs increases. For more energy-efficient GPUs, the usage of non-volatile memory such as Spin-Transfer Torque Magnetic Random Access Memory (STT-MRAM) as the GPU register file has been studied extensively. STT-MRAM requires a lower leakage current compared to SRAM and provides an appropriate read performance. However, using STT-MRAM directly in the GPU register file causes problems in performance and endurance because of complicated write procedures and material characteristics. To overcome these challenges, we propose a novel register file architecture and its management system for GPUs, named Hi-End, which exploits the data locality and compressibility of the register file. For STT-MRAM-based GPU register files, Hi-End increases the data write performance and endurance by caching and data compression, respectively. In our evaluation, Hi-End enhances the energy efficiency of a GPU register file by 70.02% and reduces the write operations by up to 95.98% with negligible performance degradation compared to SRAM-based register files.INDEX TERMS Graphics processing unit, register file, spin-transfer torque magnetic random access memory, data compression, energy efficiency, endurance, chip area.

show abstract

Deep learning with GPUs

Jeon

Lee

et al. 2021

View full text Add to dashboard Cite

PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit

et al. 2021

View full text Add to dashboard Cite

A large amount of memory usage in recent machine learning applications imposes a high degree of system burden in terms of power and processing speed. To cope with such a problem, Processing-In-Memory (PIM) techniques can be applied to and be an alternative solution. Especially, the recommendation system which is one of the major machine learning applications in data centers requires a huge memory capacity and can be a good candidate application helped by the PIM technique. In this paper, we introduce a machine learning framework designed for in-memory neural processing units and its evaluation environment, named PIMCaffe. PIMCaffe consists of two components; a Caffe2-based deep learning framework that supports PIM acceleration and a PIM-emulating hardware platform. We develop a suite of functions, libraries, application programming interfaces, and a device driver to support the framework. In addition, we implement a prototype Neural Processing Unit (NPU) in PIMCaffe to evaluate the performance of our platform with machine learning applications. Our prototype NPU design includes a vector processor for parallel vector processing and a systolic array unit for matrix multiplication. Using the proposed software framework, we perform a detailed analysis on the in-memory neural processing unit. PIMCaffe supports evaluations of recommendation systems and various convolutional neural network models on the in-memory neural processing unit. PIMCaffe with the NPU shows up to 2.26x, 5.99x, and 1.71x speedup for the recommendation system, AlexNet, and ResNet-50 respectively compared to the ARM Cortex-A53 CPU.

show abstract

Tag suggestion using visual content and social tag

Jeon

Cho

Cha

et al. 2011

View full text Add to dashboard Cite

Efficient and reliable NAND flash channel for high-speed solid state drives

Yoon

Jeong

Jeon

et al. 2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Won Jeon

Hi-End: Hierarchical, Endurance-Aware STT-MRAM-Based Register File for Energy-Efficient GPUs

Deep learning with GPUs

PIMCaffe: Functional Evaluation of a Machine Learning Framework for In-Memory Neural Processing Unit

Tag suggestion using visual content and social tag

Efficient and reliable NAND flash channel for high-speed solid state drives

Contact Info

Product

Resources

About