Ping Li scite author profile

Ping Li

5Publications

155Citation Statements Received

39Citation Statements Given

How they've been cited

344

155

How they cite others

127

Affiliations

Huazhong University of Science and Technology, Baidu (China), Cognitive Research (United States)

Publications

Order By: Most citations

Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization

Tay

et al. 2019

AAAI

View full text Add to dashboard Cite

In this paper, we present a novel unsupervised video summarization model that requires no manual annotation. The proposed model termed Cycle-SUM adopts a new cycleconsistent adversarial LSTM architecture that can effectively maximize the information preserving and compactness of the summary video. It consists of a frame selector and a cycle-consistent learning based evaluator. The selector is a bi-direction LSTM network that learns video representations that embed the long-range relationships among video frames. The evaluator defines a learnable information preserving metric between original video and summary video and "supervises" the selector to identify the most informative frames to form the summary video. In particular, the evaluator is composed of two generative adversarial networks (GANs), in which the forward GAN is learned to reconstruct original video from summary video while the backward GAN learns to invert the processing. The consistency between the output of such cycle learning is adopted as the information preserving metric for video summarization. We demonstrate the close relation between mutual information maximization and such cycle learning procedure. Experiments on two video summarization benchmark datasets validate the state-of-theart performance and superiority of the Cycle-SUM model over previous baselines.

show abstract

S²-MLP: Spatial-Shift MLP Architecture for Vision

Liu

Cai

et al. 2022

135

View full text Add to dashboard Cite

End-to-end Deep Reinforcement Learning Based Coreference Resolution

Fei

et al. 2019

View full text Add to dashboard Cite

Recent neural network models have significantly advanced the task of coreference resolution. However, current neural coreference models are typically trained with heuristic loss functions that are computed over a sequence of local decisions. In this paper, we introduce an end-to-end reinforcement learning based coreference resolution model to directly optimize coreference evaluation metrics. Specifically, we modify the state-of-the-art higherorder mention ranking approach in Lee et al. (2018) to a reinforced policy gradient model by incorporating the reward associated with a sequence of coreference linking actions. Furthermore, we introduce maximum entropy regularization for adequate exploration to prevent the model from prematurely converging to a bad local optimum. Our proposed model achieves new state-of-the-art performance on the English OntoNotes v5.0 benchmark.

show abstract

Adult Image and Video Recognition by a Deep Multicontext Network and Fine-to-Coarse Strategy

Ling

Han

et al. 2017

ACM Trans. Intell. Syst. Technol.

View full text Add to dashboard Cite

Adult image and video recognition is an important and challenging problem in the real world. Low-level feature cues do not produce good enough information, especially when the dataset is very large and has various data distributions. This issue raises a serious problem for conventional approaches. In this article, we tackle this problem by proposing a deep multicontext network with fine-to-coarse strategy for adult image and video recognition. We employ a deep convolution networks to model fusion features of sensitive objects in images. Global contexts and local contexts are both taken into consideration and are jointly modeled in a unified multicontext deep learning framework. To make the model more discriminative for diverse target objects, we investigate a novel hierarchical method, and a task-specific fine-to-coarse strategy is designed to make the multicontext modeling more suitable for adult object recognition. Furthermore, some recently proposed deep models are investigated. Our approach is extensively evaluated on four different datasets. One dataset is used for ablation experiments, whereas others are used for generalization experiments. Results show significant and consistent improvements over the state-of-the-art methods.

show abstract

Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks

Tay

et al. 2020

IEEE Trans. Multimedia

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ping Li

Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization

S²-MLP: Spatial-Shift MLP Architecture for Vision

End-to-end Deep Reinforcement Learning Based Coreference Resolution

Adult Image and Video Recognition by a Deep Multicontext Network and Fine-to-Coarse Strategy

Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks

Contact Info

Product

Resources

About

Ping Li

Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization

S2-MLP: Spatial-Shift MLP Architecture for Vision

End-to-end Deep Reinforcement Learning Based Coreference Resolution

Adult Image and Video Recognition by a Deep Multicontext Network and Fine-to-Coarse Strategy

Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks

Contact Info

Product

Resources

About

S²-MLP: Spatial-Shift MLP Architecture for Vision