Yutong Zhang scite author profile

Yutong Zhang

3Publications

13Citation Statements Received

198Citation Statements Given

How they've been cited

How they cite others

104

198

Affiliations

Beijing Institute of Technology

Publications

Order By: Most citations

SimpleTrack: Rethinking and Improving the JDE Approach for Multi-Object Tracking

Ding

Wei

et al. 2022

Sensors

View full text Add to dashboard Cite

Joint detection and embedding (JDE) methods usually fuse the target motion information and appearance information as the data association matrix, which could fail when the target is briefly lost or blocked in multi-object tracking (MOT). In this paper, we aim to solve this problem by proposing a novel association matrix, the Embedding and GioU (EG) matrix, which combines the embedding cosine distance and GioU distance of objects. To improve the performance of data association, we develop a simple, effective, bottom-up fusion tracker for re-identity features, named SimpleTrack, and propose a new tracking strategy which can mitigate the loss of detection targets. To show the effectiveness of the proposed method, experiments are carried out using five different state-of-the-art JDE-based methods. The results show that by simply replacing the original association matrix with our EG matrix, we can achieve significant improvements in IDF1, HOTA and IDsw metrics, and increase the tracking speed of these methods by around 20%. In addition, our SimpleTrack has the best data association capability among the JDE-based methods, e.g., 61.6 HOTA and 76.3 IDF1, on the test set of MOT17 with 23 FPS running speed on a single GTX2080Ti GPU.

show abstract

A fast manhattan frame estimation method based on normal vectors

Zhang

Ding

Song

et al. 2022

Journal of Field Robotics

View full text Add to dashboard Cite

In most human made scenes, such as high-rise urban city or indoor environment, the surface normal vectors or direction vectors are concentrated in three orthogonal principal directions. The scene of such a pattern is called Manhattan World (MW), and the coordinate frame formed by the three principal directions is called Manhattan Frame (MF). MF estimation methods have been applied to many different fields, such as scene reconstruction, Visual based Simultaneous Localization And Mapping (V-SLAM) and camera calibration. In this paper, we propose a novel MF estimation method based on a set of normal vectors. A cost function of normal vectors and MF axes is introduced based on the trigonometric function. For computational purpose, the cost function is significantly simplified by making use of vector dot and cross products, and the reduced cost function only involves 14 scalar parameters that need to be computed with O(n) complexity. The experimental results show that the proposed MF estimation method has excellent real-time performance and gives high accuracy on both the virtual and real-world benchmark datasets of different sizes.

show abstract

FSD-BRIEF: A Distorted BRIEF Descriptor for Fisheye Image Based on Spherical Perspective Model

Zhang

Song

Ding

et al. 2021

Sensors

View full text Add to dashboard Cite

Fisheye images with a far larger Field of View (FOV) have severe radial distortion, with the result that the associated image feature matching process cannot achieve the best performance if the traditional feature descriptors are used. To address this challenge, this paper reports a novel distorted Binary Robust Independent Elementary Feature (BRIEF) descriptor for fisheye images based on a spherical perspective model. Firstly, the 3D gray centroid of feature points is designed, and the position and direction of the feature points on the spherical image are described by a constructed feature point attitude matrix. Then, based on the attitude matrix of feature points, the coordinate mapping relationship between the BRIEF descriptor template and the fisheye image is established to realize the computation associated with the distorted BRIEF descriptor. Four experiments are provided to test and verify the invariance and matching performance of the proposed descriptor for a fisheye image. The experimental results show that the proposed descriptor works well for distortion invariance and can significantly improve the matching performance in fisheye images.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yutong Zhang

SimpleTrack: Rethinking and Improving the JDE Approach for Multi-Object Tracking

A fast manhattan frame estimation method based on normal vectors

FSD-BRIEF: A Distorted BRIEF Descriptor for Fisheye Image Based on Spherical Perspective Model

Contact Info

Product

Resources

About