Huabin Liu scite author profile

Huabin Liu

5Publications

34Citation Statements Received

159Citation Statements Given

How they've been cited

How they cite others

192

159

Affiliations

Shanghai Jiao Tong University

Publications

Order By: Most citations

TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition

Liu

Qian

et al. 2022

AAAI

View full text Add to dashboard Cite

Few-shot action recognition aims to recognize novel action classes (query) using just a few samples (support). The majority of current approaches follow the metric learning paradigm, which learns to compare the similarity between videos. Recently, it has been observed that directly measuring this similarity is not ideal since different action instances may show distinctive temporal distribution, resulting in severe misalignment issues across query and support videos. In this paper, we arrest this problem from two distinct aspects -- action duration misalignment and action evolution misalignment. We address them sequentially through a Two-stage Action Alignment Network (TA2N). The first stage locates the action by learning a temporal affine transform, which warps each video feature to its action duration while dismissing the action-irrelevant feature (e.g. background). Next, the second stage coordinates query feature to match the spatial-temporal action evolution of support by performing temporally rearrange and spatially offset prediction. Extensive experiments on benchmark datasets show the potential of the proposed method in achieving state-of-the-art performance for few-shot action recognition.

show abstract

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization

Qian

Liu

et al. 2021

View full text Add to dashboard Cite

Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition

Liu

et al. 2022

IEEE Trans. Multimedia

View full text Add to dashboard Cite

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization

Qian

Liu

et al. 2021

Preprint

View full text Add to dashboard Cite

The crux of self-supervised video representation learning is to build general features from unlabeled videos. However, most recent works have mainly focused on high-level semantics and neglected lower-level representations and their temporal relationship which are crucial for general video understanding. To address these challenges, this paper proposes a multi-level feature optimization framework to improve the generalization and temporal modeling ability of learned video representations. Concretely, high-level features obtained from naive and prototypical contrastive learning are utilized to build distribution graphs, guiding the process of low-level and mid-level feature learning. We also devise a simple temporal modeling module from multi-level features to enhance motion pattern learning. Experiments demonstrate that multi-level feature optimization with the graph constraint and temporal modeling can greatly improve the representation ability in video understanding. Code is available here.

show abstract

Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition

Liu

See

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Huabin Liu

TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization

Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition

Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization

Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition

Contact Info

Product

Resources

About