Fanxin Li scite author profile

Fanxin Li

5Publications

17Citation Statements Received

154Citation Statements Given

How they've been cited

How they cite others

119

154

Affiliations

East China University of Science and Technology, University of Hong Kong, University of Chinese Academy of Sciences

Publications

Order By: Most citations

vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training

Zhao

Chen

et al. 2022

IEEE Trans. Parallel Distrib. Syst.

View full text Add to dashboard Cite

The increasing computational complexity of DNNs achieved unprecedented successes in various areas such as machine vision and natural language processing (NLP), e.g., the recent advanced Transformer has billions of parameters. However, as large-scale DNNs significantly exceed GPU's physical memory limit, they cannot be trained by conventional methods such as data parallelism. Pipeline parallelism that partitions a large DNN into small subnets and trains them on different GPUs is a plausible solution. Unfortunately, the layer partitioning and memory management in existing pipeline parallel systems are fixed during training, making them easily impeded by out-of-memory errors and the GPU under-utilization. These drawbacks amplify when performing neural architecture search (NAS) such as the evolved Transformer, where different network architectures of Transformer needed to be trained repeatedly. VPIPE is the first system that transparently provides dynamic layer partitioning and memory management for pipeline parallelism. VPIPE has two unique contributions, including (1) an online algorithm for searching a near-optimal layer partitioning and memory management plan, and (2) a live layer migration protocol for re-balancing the layer distribution across a training pipeline. VPIPE improved the training throughput of two notable baselines (Pipedream and GPipe) by 61.4%-463.4% and 24.8%-291.3% on various large DNNs and training settings.

show abstract

NASPipe: high performance and reproducible pipeline parallel supernet training via causal synchronous parallelism

Zhao

Chen

et al. 2022

View full text Add to dashboard Cite

Supernet training, a prevalent and important paradigm in Neural Architecture Search, embeds the whole DNN architecture search space into one monolithic supernet, iteratively activates a subset of the supernet (i.e., a subnet) for fitting each batch of data, and searches a high-quality subnet which meets specific requirements.Although training subnets in parallel on multiple GPUs is desirable for acceleration, there inherently exists a race hazard that concurrent subnets may access the same DNN layers. Existing systems support neither efficiently parallelizing subnets' training executions, nor resolving the race hazard deterministically, leading to unreproducible training procedures and potentiallly non-trivial accuracy loss.We present NASPipe, the first high-performance and reproducible distributed supernet training system via causal synchronous parallel (CSP) pipeline scheduling abstraction: NASPipe partitions a supernet across GPUs and concurrently executes multiple generated sub-tasks (subnets) in a pipelined manner; meanwhile, it oversees the correlations between the subnets and deterministically resolves any causal dependency caused by subnets' layer sharing. To obtain high performance, NASPipe's CSP scheduler exploits the fact that the larger a supernet spans, the fewer dependencies manifest between chronologically close subnets; therefore, it aggressively schedules the subnets with larger chronological orders into execution, only if they are not causally dependent on unfinished

show abstract

HAMS: High Availability for Distributed Machine Learning Service Graphs

Zhao

Chen

Wang

et al. 2020

View full text Add to dashboard Cite

Electro-Optical Imaging Technology Based on Microlens Array and Fiber Interferometer

et al. 2019

View full text Add to dashboard Cite

To reduce the volume and weight of traditional optical telescopes effectively, this article proposes an electro-optical imaging technology based on a microlens array and fiber interferometer. Pairs of microlenses in the microlens array collect light and couple it into a fiber interferometer to form interference fringes. Then the amplitude and phase of a large number of interferometer baselines are analyzed to generate images. In this work, the principle of electro-optical imaging technology has been analyzed according to the partially coherent light theory. The microlens-array arrangement method and baseline pairing method have been optimized for arbitrary targets. From the simulation results, it was found that the imaging resolution depends on the maximum baseline length, and the imaging quality could be effectively improved by adjusting the Nyquist sampling density and baseline pairing method. This technology can provide an important reference for the miniaturization and complanation of imaging systems.

show abstract

Vision-Based Measurement Method for Internal Riveting of the Joist

Pan

et al. 2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Fanxin Li

vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training

NASPipe: high performance and reproducible pipeline parallel supernet training via causal synchronous parallelism

HAMS: High Availability for Distributed Machine Learning Service Graphs

Electro-Optical Imaging Technology Based on Microlens Array and Fiber Interferometer

Vision-Based Measurement Method for Internal Riveting of the Joist

Contact Info

Product

Resources

About