Ruicong Xu scite author profile

Ruicong Xu

5Publications

48Citation Statements Received

125Citation Statements Given

How they've been cited

How they cite others

124

Affiliations

The University of Tokyo, Sun Yat-sen University, Institut Franco-Chinois de l'Energie Nucléaire

Publications

Order By: Most citations

A Proposal-Based Approach for Activity Image-to-Video Retrieval

Niu

Zhang

et al. 2020

AAAI

View full text Add to dashboard Cite

Activity image-to-video retrieval task aims to retrieve videos containing the similar activity as the query image, which is a challenging task because videos generally have many background segments irrelevant to the activity. In this paper, we utilize R-C3D model to represent a video by a bag of activity proposals, which can filter out background segments to some extent. However, there are still noisy proposals in each bag. Thus, we propose an Activity Proposal-based Image-to-Video Retrieval (APIVR) approach, which incorporates multi-instance learning into cross-modal retrieval framework to address the proposal noise issue. Specifically, we propose a Graph Multi-Instance Learning (GMIL) module with graph convolutional layer, and integrate this module with classification loss, adversarial loss, and triplet loss in our cross-modal retrieval framework. Moreover, we propose geometry-aware triplet loss based on point-to-subspace distance to preserve the structural information of activity proposals. Extensive experiments on three widely-used datasets verify the effectiveness of our approach.

show abstract

Experimental study on sloshing characteristics in a pool with stratified liquids

Cheng¹,

Xu²,

Jin³

et al. 2020

Annals of Nuclear Energy

View full text Add to dashboard Cite

Efficient Binary Coding for Subspace-based Query-by-Image Video Retrieval

Yang

Shen

et al. 2017

View full text Add to dashboard Cite

The query-by-image video retrieval (QBIVR) task has been attracting considerable research attention recently. However, most existing methods represent a video by either aggregating or projecting all its frames into a single datum point, which may easily cause severe information loss. In this paper, we propose an efficient QBIVR framework to enable an effective and efficient video search with image query. We first define a similarity-preserving distance metric between an image and its orthogonal projection in the subspace of the video, which can be equivalently transformed to a Maximum Inner Product Search (MIPS) problem. Besides, to boost the efficiency of solving the MIPS problem, we propose two asymmetric hashing schemes, which bridge the domain gap of images and videos. The first approach, termed Inner-product Binary Coding (IBC), preserves the inner relationships of images and videos in a common Hamming space. To further improve the retrieval efficiency, we devise a Bilinear Binary Coding (BBC) approach, which employs compact bilinear projections instead of a single large projection matrix. Extensive experiments have been conducted on four real-world video datasets to verify the effectiveness of our proposed approaches as compared to the state-of-the-arts.

show abstract

Activity Image-to-Video Retrieval by Disentangling Appearance and Motion

Liu

Niu

et al. 2021

AAAI

View full text Add to dashboard Cite

With the rapid emergence of video data, image-to-video retrieval has attracted much attention. There are two types of image-to-video retrieval: instance-based and activity-based. The former task aims to retrieve videos containing the same main objects as the query image, while the latter focuses on finding the similar activity. Since dynamic information plays a significant role in the video, we pay attention to the latter task to explore the motion relation between images and videos. In this paper, we propose a Motion-assisted Activity Proposal-based Image-to-Video Retrieval (MAP-IVR) approach to disentangle the video features into motion features and appearance features and obtain appearance features from the images. Then, we perform image-to-video translation to improve the disentanglement quality. The retrieval is performed in both appearance and video feature spaces. Extensive experiments demonstrate that our MAP-IVR approach remarkably outperforms the state-of-the-art approaches on two benchmark activity-based video datasets.

show abstract

Knowledge from recent investigations on sloshing motion in a liquid pool with solid particles for severe accident analyses of sodium-cooled fast reactor

Cheng

et al. 2022

Nuclear Engineering and Technology

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ruicong Xu

A Proposal-Based Approach for Activity Image-to-Video Retrieval

Experimental study on sloshing characteristics in a pool with stratified liquids

Efficient Binary Coding for Subspace-based Query-by-Image Video Retrieval

Activity Image-to-Video Retrieval by Disentangling Appearance and Motion

Knowledge from recent investigations on sloshing motion in a liquid pool with solid particles for severe accident analyses of sodium-cooled fast reactor

Contact Info

Product

Resources

About