Target tracking technology that is based on aerial videos is widely used in many fields; however, this technology has challenges, such as image jitter, target blur, high data dimensionality, and large changes in the target scale. In this paper, the research status of aerial video tracking and the characteristics, background complexity and tracking diversity of aerial video targets are summarized. Based on the findings, the key technologies that are related to tracking are elaborated according to the target type, number of targets and applicable scene system. The tracking algorithms are classified according to the type of target, and the target tracking algorithms that are based on deep learning are classified according to the network structure. Commonly used aerial photography datasets are described, and the accuracies of commonly used target tracking methods are evaluated in an aerial photography dataset, namely, UAV123, and a long-video dataset, namely, UAV20L. Potential problems are discussed, and possible future research directions and corresponding development trends in this field are analyzed and summarized.
Due to the interference of the external environment such as rainy weather on the camera, raindrops can easily adhere to the lens and seriously affect the quality of the photos taken. Therefore, it is of great significance to remove raindrops from the image and improve the quality of the photo. In this paper, a raindrop method for generative adversarial network images based on differential learning is proposed. The general generative network is to input images with raindrops and output clean images. The generative network in this paper does not directly output clean images, but learning the difference between images with raindrops and without raindrops, then subtract the learned difference from the image with raindrops to generate a clean image. In order to learn this difference more effectively, adding reconstruction loss to the generative network, the pre-trained VGG-16 network is used to extract the difference between the generated image and the real image features and calculate the mean square error. The experimental results show, the method in this paper can not only remove the raindrops in the image well, but also reconstruct the image information of the part blocked by the raindrops. The image processed by the algorithm in this paper is tested using the yolov3 target detection algorithm, which can significantly improve the recognition accuracy of the detection algorithm.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.