2020
DOI: 10.48550/arxiv.2005.13243
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Poly-YOLO: higher speed, more precise detection and instance segmentation for YOLOv3

Abstract: We present a new version of YOLO with better performance and extended with instance segmentation called Poly-YOLO. Poly-YOLO builds on the original ideas of YOLOv3 and removes two of its weaknesses: a large amount of rewritten labels and inefficient distribution of anchors. Poly-YOLO reduces the issues by aggregating features from a light SE-Darknet-53 backbone with a hypercolumn technique, using stairstep upsampling, and produces a single scale output with high resolution. In comparison with YOLOv3, Poly-YOLO… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 31 publications
(52 reference statements)
0
10
0
Order By: Relevance
“…3) Comparison with Single-stage object detection and Others: Poly YOLO [16] reported ∼ 22 fps on a 416×832 image with an AP score of 8.7. Other methods like Deep Watershed [48] and SGN [49] incur a huge computation complexity in their instance assignment techniques.…”
Section: State Of the Art Comparisonmentioning
confidence: 99%
See 1 more Smart Citation
“…3) Comparison with Single-stage object detection and Others: Poly YOLO [16] reported ∼ 22 fps on a 416×832 image with an AP score of 8.7. Other methods like Deep Watershed [48] and SGN [49] incur a huge computation complexity in their instance assignment techniques.…”
Section: State Of the Art Comparisonmentioning
confidence: 99%
“…Recently, single stage instance segmentation methods have been developed [15], [16]. These major approaches use fully convolution networks so that they can be trained in an endto-end fashion.…”
Section: Introductionmentioning
confidence: 99%
“…A polygon is a generic representation for any arbitrary shape; however, it is more expensive to annotate than a bounding box. The object contour can be uniformly sampled in the range of 360°split into N equal polygon vertices, each represented by the radial distance r from the object's centroid as used in PolyYOLO [16]. We observe that uniform sampling cannot efficiently represent high curvature variations in the fisheye image object contours.…”
Section: B Generalized Object Detectionmentioning
confidence: 99%
“…Subsequently, BshapeNet [26] designs extended frameworks by adding a bounding box mask branch providing additional information of object positions and coordinates to a Faster R-CNN to enhance the performance of instance segmentation. Recently, a few works propose some novel contourbased approaches for real-time instance segmentation [27], [28]. Deep Snake [27] uses the circular convolution for feature learning on the contour and proposes a two-stage pipeline including initial contour proposal and contour deformation for instance segmentation.…”
Section: B Instance Segmentationmentioning
confidence: 99%
“…Deep Snake [27] uses the circular convolution for feature learning on the contour and proposes a two-stage pipeline including initial contour proposal and contour deformation for instance segmentation. POLY-YOLO [28] increases the detection accuracy of YOLOv3 and realizes instance segmentation using tight polygon-based contour. However, most of these methods focus on appearance cues and can only segment objects that have been labeled as a specific category in a training set.…”
Section: B Instance Segmentationmentioning
confidence: 99%