Wei Peng scite author profile

Human action recognition from skeleton data, fuelled by the Graph Convolutional Network (GCN) with its powerful capability of modeling non-Euclidean data, has attracted lots of attention. However, many existing GCNs provide a pre-defined graph structure and share it through the entire network, which can loss implicit joint correlations especially for the higher-level features. Besides, the mainstream spectral GCN is approximated by one-order hop such that higher-order connections are not well involved. All of these require huge efforts to design a better GCN architecture. To address these problems, we turn to Neural Architecture Search (NAS) and propose the first automatically designed GCN for this task. Specifically, we explore the spatial-temporal correlations between nodes and build a search space with multiple dynamic graph modules. Besides, we introduce multiple-hop modules and expect to break the limitation of representational capacity caused by one-order approximation. Moreover, a corresponding sampling- and memory-efficient evolution strategy is proposed to search in this space. The resulted architecture proves the effectiveness of the higher-order approximation and the layer-wise dynamic graph modules. To evaluate the performance of the searched model, we conduct extensive experiments on two very large scale skeleton-based action recognition datasets. The results show that our model gets the state-of-the-art results in term of given metrics.

show abstract

Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement

Peng

et al. 2019

210

163

View full text Add to dashboard Cite

Remote photoplethysmography (rPPG), which aims at measuring heart activities without any contact, has great potential in many applications (e.g., remote healthcare). Existing rPPG approaches rely on analyzing very fine details of facial videos, which are prone to be affected by video compression. Here we propose a two-stage, endto-end method using hidden rPPG information enhancement and attention networks, which is the first attempt to counter video compression loss and recover rPPG signals from highly compressed videos. The method includes two parts: 1) a Spatio-Temporal Video Enhancement Network (STVEN) for video enhancement, and 2) an rPPG network (rPPGNet) for rPPG signal recovery. The rPPGNet can work on its own for robust rPPG measurement, and the STVEN network can be added and jointly trained to further boost the performance especially on highly compressed videos. Comprehensive experiments are performed on two benchmark datasets to show that, 1) the proposed method not only achieves superior performance on compressed videos with high-quality videos pair, 2) it also generalizes well on novel data with only compressed videos available, which implies the promising potential for realworld applications.

show abstract

On the equivalence between Non-negative Matrix Factorization and Probabilistic Latent Semantic Indexing

Ding

Peng

2008

Computational Statistics & Data Analysis

219

143

View full text Add to dashboard Cite

Non-negative Matrix Factorization (NMF) and Probabilistic Latent Semantic Indexing (PLSI) have been successfully applied to document clustering recently. In this paper, we show that PLSI and NMF (with the I-divergence objective function) optimize the same objective function, although PLSI and NMF are different algorithms as verified by experiments. This provides a theoretical basis for a new hybrid method that runs PLSI and NMF alternatively, each jumping out of the local minima of the other method successively, thus achieving a better final solution. Extensive experiments on five real-life datasets show relations between NMF and PLSI, and indicate that the hybrid method leads to significant improvements over NMF-only or PLSI-only methods. We also show that at first-order approximation, NMF is identical to the χ 2 -statistic.

show abstract

Revealing the Invisible With Model and Data Shrinking for Composite-Database Micro-Expression Recognition

Xia

Peng

Khor

et al. 2020

IEEE Trans. on Image Process.

View full text Add to dashboard Cite

Composite-database micro-expression recognition is attracting increasing attention as it is more practical to realworld applications. Though the composite database provides more sample diversity for learning good representation models, the important subtle dynamics are prone to disappearing in the domain shift such that the models greatly degrade their performance, especially for deep models. In this paper, we analyze the influence of learning complexity, including the input complexity and model complexity, and discover that the lower-resolution input data and shallower-architecture model are helpful to ease the degradation of deep models in composite-database task. Based on this, we propose a recurrent convolutional network (RCN) to explore the shallower-architecture and lower-resolution input data, shrinking model and input complexities simultaneously. Furthermore, we develop three parameter-free modules (i.e., wide expansion, shortcut connection and attention unit) to integrate with RCN without increasing any learnable parameters. These three modules can enhance the representation ability in various perspectives while preserving not-very-deep architecture for lower-resolution data. Besides, three modules can further be combined by an automatic strategy (a neural architecture search strategy) and the searched architecture becomes more robust. Extensive experiments on MEGC2019 dataset (composited of existing SMIC, CASME II and SAMM datasets) have verified the influence of learning complexity and shown that RCNs with three modules and the searched combination outperform the state-ofthe-art approaches.

show abstract

Hyperbolic Deep Neural Networks: A Survey

Peng

Varanka

Mostafa

et al. 2022

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

Recently, there has been a raising surge of momentum for deep representation learning in hyperbolic spaces due to their high capacity of modeling data like knowledge graphs or synonym hierarchies, possessing hierarchical structure. We refer it as hyperbolic deep neural network in this paper. Such a hyperbolic neural architecture potentially leads to drastically compact models with much more physical interpretability than its counterpart in Euclidean space. To stimulate future research, this paper presents a coherent and comprehensive review of the literature around the neural components in the construction of hyperbolic deep neural networks, as well as the generalization of the leading deep approaches to the hyperbolic space. It also presents current applications around various machine learning tasks on several publicly available datasets, together with insightful observations and identifying open questions and promising future directions.

show abstract

Predicting collective sentiment dynamics from time-series social media

Nguyen

Chan

et al. 2012

View full text Add to dashboard Cite

Social network user influence sense-making and dynamics prediction

Peng

et al. 2014

Expert Systems with Applications

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wei Peng

Orthogonal nonnegative matrix t-factorizations for clustering

Learning Graph Convolutional Network for Skeleton-Based Human Action Recognition by Neural Searching

Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement

On the equivalence between Non-negative Matrix Factorization and Probabilistic Latent Semantic Indexing

Revealing the Invisible With Model and Data Shrinking for Composite-Database Micro-Expression Recognition

Hyperbolic Deep Neural Networks: A Survey

Predicting collective sentiment dynamics from time-series social media

Social network user influence sense-making and dynamics prediction

Contact Info

Product

Resources

About