2017
DOI: 10.1007/978-3-319-61657-5_3
|View full text |Cite
|
Sign up to set email alerts
|

CMS-RCNN: Contextual Multi-Scale Region-Based CNN for Unconstrained Face Detection

Abstract: Robust face detection in the wild is one of the ultimate components to support various facial related problems, i.e. unconstrained face recognition, facial periocular recognition, facial landmarking and pose estimation, facial expression recognition, 3D facial model construction, etc. Although the face detection problem has been intensely studied for decades with various commercial applications, it still meets problems in some real-world scenarios due to numerous challenges, e.g. heavy facial occlusions, extre… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
199
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 213 publications
(204 citation statements)
references
References 37 publications
1
199
0
Order By: Relevance
“…In order to validate the effectiveness of the proposed method, Faster RCNN [15,38] and CMS-RCNN [29] are applied to Sentinel-1 dataset. CMS-RCNN, which has the same resolution as conv5, fuses conv3, con4 and conv5 by down-sampling.…”
Section: Comparisons With Other Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…In order to validate the effectiveness of the proposed method, Faster RCNN [15,38] and CMS-RCNN [29] are applied to Sentinel-1 dataset. CMS-RCNN, which has the same resolution as conv5, fuses conv3, con4 and conv5 by down-sampling.…”
Section: Comparisons With Other Methodsmentioning
confidence: 99%
“…Since such an operation simply reverses the forward and backward Due to the respective merits that different layers possess, multiple layers fusion is a popular way to enhance the performance of detection in the current top-performance detector. As CMS-RCNN [29] did, the first way is to integrate down-sampled earlier layers with the last layer of the sharing CNN. Despite the fact that the feature map information is increased, small-sized objects still only cause responses on a tiny area in a fused feature map.…”
Section: Layer Up-sampling With Deconvolutionmentioning
confidence: 99%
See 2 more Smart Citations
“…Zhang et al[194] proposed FDNet based on ResNet with larger deformable convolutional kernels to capture image context. Zhu et al[195] proposed a Contextual Multi-Scale Region-based Convolution Neural Network (CMS-RCNN) in which multi-scale information was grouped both in region proposal and ROI detection to deal with faces at various range of scale. In addition, contextual information around faces is also considered in training detectors.…”
mentioning
confidence: 99%