2017
DOI: 10.1109/tpami.2016.2577031
|View full text |Cite
|
Sign up to set email alerts
|

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Abstract: State-of-the-art object detection networks depend on region proposal algorithms to hypothesize object locations. Advances like SPPnet [1] and Fast R-CNN [2] have reduced the running time of these detection networks, exposing region proposal computation as a bottleneck. In this work, we introduce a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals. An RPN is a fully convolutional network that simultaneously pre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

51
28,207
2
189

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 31,341 publications
(28,247 citation statements)
references
References 41 publications
51
28,207
2
189
Order By: Relevance
“…It is trained to predict which of the two positions results in a win. Note that similar to the most successful object detection methods [11], we found a 2-value output to outperform one binary output. Figure 1 illustrates the neural network architecture.…”
Section: Learning To Compare Positionssupporting
confidence: 75%
“…It is trained to predict which of the two positions results in a win. Note that similar to the most successful object detection methods [11], we found a 2-value output to outperform one binary output. Figure 1 illustrates the neural network architecture.…”
Section: Learning To Compare Positionssupporting
confidence: 75%
“…achieving high object recall with less number of bounding boxes, preferably with a small computational overhead and the potential to scale to hundreds of object categories [35,40,37]. Here, discriminative methods based on deep learning models have helped improve the ranking quality of proposal approaches [7,37,28,32]. Inspired by this work, we extend the use of deep and recurrent networks to temporal action proposal generation by introducing a new architecture.…”
Section: Related Workmentioning
confidence: 99%
“…of efficiency and high detection rates [29,28]. Efficient object proposal modules have also enabled a boost in performance of other high-level visual tasks, such as simultaneous detection and segmentation, object tracking, and image captioning [13,14,16,20].…”
Section: Introductionmentioning
confidence: 99%
“…In vision, abstractions can include object detection (Girshick et al, 2014;Girshick, 2015;Ren et al, 2015), classification (Krizhevsky et al, 2012;Simonyan and Zisserman, 2014;Szegedy et al, 2015), and semantic understanding (Huang et al, 2013) using convolution neural networks (LeCun and Bengio, 1995). Inspired by the hierarchical architecture of the human visual cortex (Hubel and Wiesel, 1962), architectures for multiple convolution-pooling layers have been proposed and are being used in different machine learning tasks.…”
Section: Trends In the Development Of Ai Technology Applications Formentioning
confidence: 99%