Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval 2017
DOI: 10.1145/3078971.3078990
|View full text |Cite
|
Sign up to set email alerts
|

Improving Small Object Proposals for Company Logo Detection

Abstract: Many modern approaches for object detection are two-staged pipelines. The first stage identifies regions of interest which are then classified in the second stage. Faster R-CNN is such an approach for object detection which combines both stages into a single pipeline. In this paper we apply Faster R-CNN to the task of company logo detection. Motivated by its weak performance on small object instances, we examine in detail both the proposal and the classification stage with respect to a wide range of object siz… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
33
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 66 publications
(38 citation statements)
references
References 18 publications
1
33
0
Order By: Relevance
“…It is essential to determine what size of the anchor box is suitable for each scale of the network. Inspired by the method of the proposal generation [33], we used mathematical derivation based on IOU to help select the appropriate size of the anchor boxes for each scale.…”
Section: Appropriate Size For Anchor Boxesmentioning
confidence: 99%
“…It is essential to determine what size of the anchor box is suitable for each scale of the network. Inspired by the method of the proposal generation [33], we used mathematical derivation based on IOU to help select the appropriate size of the anchor boxes for each scale.…”
Section: Appropriate Size For Anchor Boxesmentioning
confidence: 99%
“…This deletes most of the cobbles and fine boulder information from the data if large scale mosaics were fed directly into the network. Hence, the smaller tiles exported from the backscatter mosaics were upscaled to values between 300 and 1200 pixels, which is the simplest approach to facilitate small object detection [30,31]. The size of anchor boxes used to determine the bounding box of objects were left at their standard settings of 32, 64, 128, For classification and object detection, we use an open source RetinaNet [27] implementation in Python, available on GitHub (https://github.com/fizyr/keras-retinanet, last accessed on 6 February 2019).…”
Section: Preparation Of Train Validation and Test Datasetsmentioning
confidence: 99%
“…Therefore, it is mandatory to detect objects of the smallest possible size and to consider the minimum object size detectable by the trained models. The minimum size of objects whose detection can be trained by RetinaNet depends on a) the resolution of the input backscatter mosaic and b) the minimum anchor box of the network measured in pixels multiplied by the threshold of areal overlap of 0.5 required for a positive training [30,31]. For a minimum anchor box of 32 pixels, this results in a theoretical minimum threshold for positive training of 23 × 23 pixels.…”
Section: Constraining the Minimum Size Of Detected Bouldersmentioning
confidence: 99%
See 1 more Smart Citation
“…The AlexNet (Krizhevsky, Sutskever, & Hinton, 2012), VGG (Simonyan, & Zisserman, 2014), GoogLeNet (Szegedy et al, 2015), and ALL-CNN (Springenberg, Dosovitskiy, Brox, & Riedmiller, 2014) incorporated deep structures with variants of layers and have achieved remarkable performance in the classification of a large number of categories. In terms of image detection, the Faster region-based CNN (Faster R-CNN) (Ren, He, Girshick, & Sun, 2015) has been proved as an efficient detection method for small object, such as ship detection from SAR images (Kang, Leng, Lin, & Ji, 2017), company logo detection from real-world images (Eggert, Zecha, Brehm, & Lienhart, 2017), cancer cell detection (Zhang, Hu, Chen, Huang, & Guan, 2016) and gland instance detection (Xu et al, 2017) from microscopic images. Until now, relatively few studies were performed to identify M. tuberculosis from microscopic images using CNNs, except for chest X-ray TB image studies (Cao et al, 2016;Silva, Silva, Pinho, & Costa, 2017).…”
Section: Introductionmentioning
confidence: 99%