2021
DOI: 10.1016/j.patcog.2021.107929
|View full text |Cite
|
Sign up to set email alerts
|

STDnet-ST: Spatio-temporal ConvNet for small object detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 49 publications
(36 citation statements)
references
References 9 publications
0
36
0
Order By: Relevance
“…However, these methods are difficult to cover the context information for video. Although there are some methods to integrate Spatio-temporal information, e.g., Spatio-temporal neural network built on STDnet (STDnet-ST) [121], the problems of missed and false inspection still persist.…”
Section: Object Detection From Uav-borne Videomentioning
confidence: 99%
See 2 more Smart Citations
“…However, these methods are difficult to cover the context information for video. Although there are some methods to integrate Spatio-temporal information, e.g., Spatio-temporal neural network built on STDnet (STDnet-ST) [121], the problems of missed and false inspection still persist.…”
Section: Object Detection From Uav-borne Videomentioning
confidence: 99%
“…Appl. 2019 -Zhang et al [118] Appearance deterioration, occlusion, motion blur VisDrone-VID MIPR 2020 -MOR-UAVNet [119] Moving object MOR-UAV MM 2020 https://visionintelligence.github.io/Datasets.html TDFA [120] Small-scale Okutama, VisDrone-VID Multidim Syst Sign P 2021 -STDnet-ST [121] Small object USC-GRAD-STDdb,UAVDT,VisDrone-VID PR 2021 - [118] and [120] used the effective CNN model for optical flow (PWC-Net) [132] method and spatial pyramid network (SPyNet) [133] to obtain the motion information of two neighbor frames, respectively. Zhu et al [134] designed fusion feature maps to achieve VID using deep feature flow (DFF) by learning the feature maps of key frames using feature extracting and of non-key frames using FlowNet.…”
Section: A Optical Flow-based Networkmentioning
confidence: 99%
See 1 more Smart Citation
“…However, it is noteworthy that the existing networks, such as AlexNet, RCNN, and Fast-RCNN, suffer from non-negligible miss-detections and low recall for the small spots. For instance, if the spot pixels are <32 32 (Bosquet et al, 2018 ), or when the image resolution is not high. According to the definition of the international organization SPIE, a small target is a target area <80 pixels in a 256 × 256 image, that is, the target whose pixel proportion is <0.12% of the total image pixels.…”
Section: Introductionmentioning
confidence: 99%
“…Wang Hongfeng et al [19] proposed a generative adversarial network (GAN) capable of image super-resolution and two-stage small object detection, which exhibited a better detection performance than mainstream methods. Bosquet Brais et al [20] introduced STDnet-ST, an end-to-end spatiotemporal convolutional neural network for small object detection in video, which achieved state-of-the-art results for small objects. Lian Jing et al [21] proposed a small object detection method in traffic scenes based on attention feature fusion, which improved the detection accuracy of small objects in traffic scenes.…”
Section: Introductionmentioning
confidence: 99%