Ming Yan scite author profile

Finding a fixed point to a nonexpansive operator, i.e., x * = T x * , abstracts many problems in numerical linear algebra, optimization, and other areas of scientific computing. To solve fixed-point problems, we propose ARock, an algorithmic framework in which multiple agents (machines, processors, or cores) update x in an asynchronous parallel fashion. Asynchrony is crucial to parallel computing since it reduces synchronization wait, relaxes communication bottleneck, and thus speeds up computing significantly. At each step of ARock, an agent updates a randomly selected coordinate x i based on possibly out-of-date information on x. The agents share x through either global memory or communication. If writing x i is atomic, the agents can read and write x without memory locks.Theoretically, we show that if the nonexpansive operator T has a fixed point, then with probability one, ARock generates a sequence that converges to a fixed points of T . Our conditions on T and step sizes are weaker than comparable work. Linear convergence is also obtained.We propose special cases of ARock for linear systems, convex optimization, machine learning, as well as distributed and decentralized consensus problems. Numerical experiments of solving sparse logistic regression problems are presented.

show abstract

A Decentralized Proximal-Gradient Method With Network Independent Step-Sizes and Separated Convergence Rates

Shi

Yan

2019

IEEE Trans. Signal Process.

193

187

View full text Add to dashboard Cite

This paper considers the problem of decentralized optimization with a composite objective containing smooth and non-smooth terms. To solve the problem, a proximal-gradient scheme is studied. Specifically, the smooth and nonsmooth terms are dealt with by gradient update and proximal update, respectively. The studied algorithm is closely related to a previous decentralized optimization algorithm, PG-EXTRA [37], but has a few advantages. First of all, in our new scheme, agents use uncoordinated step-sizes and the stable upper bounds on step-sizes are independent from network topologies. The step-sizes depend on local objective functions, and they can be as large as that of the gradient descent. Secondly, for the special case without non-smooth terms, linear convergence can be achieved under the strong convexity assumption. The dependence of the convergence rate on the objective functions and the network are separated, and the convergence rate of our new scheme is as good as one of the two convergence rates that match the typical rates for the general gradient descent and the consensus averaging. We also provide some numerical experiments to demonstrate the efficacy of the introduced algorithms and validate our theoretical discoveries. 1 We believe that agent i's instantaneous estimation on the optimal solution is not a piece of sensitive information but the functions s i and r i are.

show abstract

Restoration of Images Corrupted by Impulse Noise and Mixed Gaussian Impulse Noise Using Blind Inpainting

Yan¹

2013

SIAM J. Imaging Sci.

143

140

View full text Add to dashboard Cite

Abstract. This article studies the problem of image restoration of observed images corrupted by impulse noise and mixed Gaussian impulse noise. Since the pixels damaged by impulse noise contain no information about the true image, how to find this set correctly is a very important problem. We propose two methods based on blind inpainting and 0 minimization that can simultaneously find the damaged pixels and restore the image. By iteratively restoring the image and updating the set of damaged pixels, these methods have better performance than other methods, as shown in the experiments. In addition, we provide convergence analysis for these methods, these algorithms will converge to coordinatewise minimum points. In addition, they will converge to local minimum points (or with probability one) with some modifications in the algorithms.

show abstract

Robust 1-bit Compressive Sensing Using Adaptive Outlier Pursuit

Yan

Yang

Osher

2012

IEEE Trans. Signal Process.

159

119

View full text Add to dashboard Cite

A New Primal–Dual Algorithm for Minimizing the Sum of Three Functions with a Linear Operator

Yan

2018

J Sci Comput

View full text Add to dashboard Cite

In this paper, we propose a new primal-dual algorithm for minimizing f (x) + g(x) + h(Ax), where f , g, and h are proper lower semi-continuous convex functions, f is differentiable with a Lipschitz continuous gradient, and A is a bounded linear operator. The proposed algorithm has some famous primal-dual algorithms for minimizing the sum of two functions as special cases. E.g., it reduces to the Chambolle-Pock algorithm when f = 0 and the proximal alternating predictor-corrector when g = 0. For the general convex case, we prove the convergence of this new algorithm in terms of the distance to a fixed point by showing that the iteration is a nonexpansive operator. In addition, we prove the O(1/k) ergodic convergence rate in the primal-dual gap. With additional assumptions, we derive the linear convergence rate in terms of the distance to the fixed point. Comparing to other primal-dual algorithms for solving the same problem, this algorithm extends the range of acceptable parameters to ensure its convergence and has a smaller per-iteration cost. The numerical experiments show the efficiency of this algorithm.

show abstract

Fast L1–L2 Minimization via a Proximal Operator

Lou¹,

Yan

2017

J Sci Comput

145

View full text Add to dashboard Cite

This paper aims to develop new and fast algorithms for recovering a sparse vector from a small number of measurements, which is a fundamental problem in the field of compressive sensing (CS). Currently, CS favors incoherent systems, in which any two measurements are as little correlated as possible. In reality, however, many problems are coherent, and conventional methods such as L 1 minimization do not work well. Recently, the difference of the L 1 and L 2 norms, denoted as L 1 -L 2 , is shown to have superior performance over the classic L 1 method, but it is computationally expensive. We derive an analytical solution for the proximal operator of the L 1 -L 2 metric, and it makes some fast L 1 solvers such as forward-backward splitting (FBS) and alternating direction method of multipliers (ADMM) applicable for L 1 -L 2 . We describe in details how to incorporate the proximal operator into FBS and ADMM and show that the resulting algorithms are convergent under mild conditions. Both algorithms are shown to be much more efficient than the original implementation of L 1 -L 2 based on a difference-of-convex approach in the numerical experiments.Keywords Compressive sensing · proximal operator · forward-backward splitting · alternating direction method of multipliers · difference-of-convex Mathematics Subject Classification (2000) 90C26 · 65K10 · 49M29

show abstract

Parallel and distributed sparse optimization

Peng

Yan

Yin

2013

View full text Add to dashboard Cite

This paper proposes parallel and distributed algorithms for solving very large-scale sparse optimization problems on computer clusters and clouds. Modern datasets usually have a large number of features or training samples, and they are usually stored in a distributed manner. Motivated by the need of solving sparse optimization problems with large datasets, we propose two approaches including (i) distributed implementations of prox-linear algorithms and (ii) GRock, a parallel greedy block coordinate descent method. Different separability properties of the objective terms in the problem enable different data distributed schemes along with their corresponding algorithm implementations. We also establish the convergence of GRock and explain why it often performs exceptionally well for sparse optimization. Numerical results on a computer cluster and Amazon EC2 demonstrate the efficiency and elasticity of our algorithms.

show abstract

Multi-Granularity Hierarchical Attention Fusion Networks for Reading Comprehension and Question Answering

Wang¹,

Yan²,

Wu³

2018

135

View full text Add to dashboard Cite

This paper describes a novel hierarchical attention network for reading comprehension style question answering, which aims to answer questions for a given narrative paragraph. In the proposed method, attention and fusion are conducted horizontally and vertically across layers at different levels of granularity between question and paragraph. Specifically, it first encode the question and paragraph with fine-grained language embeddings, to better capture the respective representations at semantic level. Then it proposes a multi-granularity fusion approach to fully fuse information from both global and attended representations. Finally, it introduces a hierarchical attention network to focuses on the answer span progressively with multi-level softalignment. Extensive experiments on the large-scale SQuAD and TriviaQA datasets validate the effectiveness of the proposed method. At the time of writing the paper (Jan. 12th 2018), our model achieves the first position on the SQuAD leaderboard for both single and ensemble models. We also achieves state-of-the-art results on TriviaQA, AddSent and AddOne-Sent datasets.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ming Yan

ARock: An Algorithmic Framework for Asynchronous Parallel Coordinate Updates

A Decentralized Proximal-Gradient Method With Network Independent Step-Sizes and Separated Convergence Rates

Restoration of Images Corrupted by Impulse Noise and Mixed Gaussian Impulse Noise Using Blind Inpainting

Robust 1-bit Compressive Sensing Using Adaptive Outlier Pursuit

A New Primal–Dual Algorithm for Minimizing the Sum of Three Functions with a Linear Operator

Fast L1–L2 Minimization via a Proximal Operator

Parallel and distributed sparse optimization

Multi-Granularity Hierarchical Attention Fusion Networks for Reading Comprehension and Question Answering

Contact Info

Product

Resources

About