Multiple Object Tracking (MOT) plays an important role in solving many fundamental problems in video analysis and computer vision. Most MOT methods employ two steps: Object Detection and Data Association. The first step detects objects of interest in every frame of a video, and the second establishes correspondence between the detected objects in different frames to obtain their tracks. Object detection has made tremendous progress in the last few years due to deep learning. However, data association for tracking still relies on hand crafted constraints such as appearance, motion, spatial proximity, grouping etc. to compute affinities between the objects in different frames. In this paper, we harness the power of deep learning for data association in tracking by jointly modeling object appearances and their affinities between different frames in an end-to-end fashion. The proposed Deep Affinity Network (DAN) learns compact, yet comprehensive features of pre-detected objects at several levels of abstraction, and performs exhaustive pairing permutations of those features in any two frames to infer object affinities. DAN also accounts for multiple objects appearing and disappearing between video frames. We exploit the resulting efficient affinity computations to associate objects in the current frame deep into the previous frames for reliable on-line tracking. Our technique is evaluated on popular multiple object tracking challenges MOT15, MOT17 and UA-DETRAC. Comprehensive benchmarking under twelve evaluation metrics demonstrates that our approach is among the best performing techniques on the leader board for these challenges. The open source implementation of our work is available at https://github.com/shijieS/SST.git.
Intelligent vehicle detection and counting are becoming increasingly important in the field of highway management. However, due to the different sizes of vehicles, their detection remains a challenge that directly affects the accuracy of vehicle counts. To address this issue, this paper proposes a vision-based vehicle detection and counting system. A new high definition highway vehicle dataset with a total of 57,290 annotated instances in 11,129 images is published in this study. Compared with the existing public datasets, the proposed dataset contains annotated tiny objects in the image, which provides the complete data foundation for vehicle detection based on deep learning. In the proposed vehicle detection and counting system, the highway road surface in the image is first extracted and divided into a remote area and a proximal area by a newly proposed segmentation method; the method is crucial for improving vehicle detection. Then, the above two areas are placed into the YOLOv3 network to detect the type and location of the vehicle. Finally, the vehicle trajectories are obtained by the ORB algorithm, which can be used to judge the driving direction of the vehicle and obtain the number of different vehicles. Several highway surveillance videos based on different scenes are used to verify the proposed methods. The experimental results verify that using the proposed segmentation method can provide higher detection accuracy, especially for the detection of small vehicle objects. Moreover, the novel strategy described in this article performs notably well in judging driving direction and counting vehicles. This paper has general practical significance for the management and control of highway scenes.
The continuous development in the construction of transportation infrastructure has brought enormous pressure to traffic control. Accurate and detailed traffic flow information is valuable for an effective traffic control strategy. This paper proposes a video-based vehicle counting framework using a three-component process of object detection, object tracking, and trajectory processing to obtain the traffic flow information. First, a dataset for vehicle object detection (VDD) and a standard dataset for verifying the vehicle counting results (VCD) were established. The object detection was then completed by deep learning with VDD. Using this detection, a matching algorithm was designed to perform multi-object tracking in combination with a traditional tracking method. Trajectories of the moving objects were obtained using this approach. Finally, a trajectory counting algorithm based on encoding is proposed. The vehicles were counted according to the vehicle categories and their moving route to obtain detailed traffic flow information. The results demonstrated that the overall accuracy of our method for vehicle counting can reach more than 90%. The running rate of the proposed framework is 20.7 frames/s on the VCD. Therefore, the proposed vehicle counting framework is capable of acquiring reliable traffic flow information, which is likely applicable to intelligent traffic control and dynamic signal timing. INDEX TERMS Object detection, object tracking, trajectory processing, vehicle counting.
This paper devotes to the image compression and encryption problems. We develop a novel hybrid scheme based on block compressive sensing. Concentrate on taking full advantage of the different frequency coefficients sparsity, the nonuniform sampling strategy is adopted to improve the compression efficiency. First, the discrete cosine transform coefficients matrices of blocks are transformed into vectors by zigzag scanning. The different frequency components are extracted in the front, middle, and back of vectors, respectively. Using the measurement matrices with different dimensions, the combination of lowand high-frequency components, together with the medium-frequency coefficients are compressed simultaneously. Second, the recombinational block measurements are re-encrypted by the permutation-diffusion framework. The logistic map is introduced for key stream generation. In order to accomplish a sensitive and effective cryptosystem, the control strategy for secret keys is employed. The simulation results indicate that the proposed scheme forms a high balance between reconstruction performance, storage and computational complexity, and hardware implementation. Moreover, the security analyses demonstrate the satisfactory performance and effectiveness of the proposed cryptosystem. The scheme can work efficiently in the parallel computing environment, especially for the images with medium and large size. INDEX TERMS Block compressive sensing, image cryptosystem, logistic map, nonuniform sampling strategy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.