2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
DOI: 10.1109/cvpr52688.2022.01393
|View full text |Cite
|
Sign up to set email alerts
|

H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-domain Weakly Supervised Object Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 20 publications
(8 citation statements)
references
References 53 publications
0
8
0
Order By: Relevance
“…A frequency division operation can extract feature maps at different frequencies to achieve the goal of preserving detail and compressing noise. Recently, data-driven approaches based on generative adversarial networks (GANs) or convolution neural networks (CNNs) have shown strong feature representation capability, which was widely applied in image enhancement, image super-resolution, object recognition, and so on [ 42 , 43 , 44 , 45 , 63 ]. Unfortunately, although these LLIE methods significantly promote contrast, saturation, and brightness, remove the color deviation, and highlight the structural details, they heavily depend on computer resources owing to the depth or width of the network.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…A frequency division operation can extract feature maps at different frequencies to achieve the goal of preserving detail and compressing noise. Recently, data-driven approaches based on generative adversarial networks (GANs) or convolution neural networks (CNNs) have shown strong feature representation capability, which was widely applied in image enhancement, image super-resolution, object recognition, and so on [ 42 , 43 , 44 , 45 , 63 ]. Unfortunately, although these LLIE methods significantly promote contrast, saturation, and brightness, remove the color deviation, and highlight the structural details, they heavily depend on computer resources owing to the depth or width of the network.…”
Section: Methodsmentioning
confidence: 99%
“…Multiscale learning structure: Generally, the image exhibits different characteristics at various scales, and a multiscale representation can effectively extract its information at different scales and promote the performance of learning-based methods [ 15 , 56 ]. As a result, the multiscale learning strategy has broadly been conducted on object identification, pose recognition, face detection, and other computer vision tasks [ 42 , 43 , 44 , 45 ]. However, this strategy is rarely considered in most state-of-the-art LLIE models.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…They focus on object classification or object detection tasks. Related methods try adapting classifiers or detectors from natural to artificial images [74,80]. However, as demonstrated in Table 5, several limitations of these datasets make them hard to bridge natural and artificial human-centric tasks.…”
Section: Datasets For Multi-scenario Generalizationmentioning
confidence: 99%
“…[69] fine-tune Faster R-CNN on People-Art to detect humans in artworks. H2FA R-CNN [74] proposes a Holistic and Hierarchical Feature Alignment R-CNN to enforce image-level alignment for object detection. [15] use image-level domain transfer and pseudo-labels from the source domain to train object detector SSD300 [35].…”
Section: Datasets For Multi-scenario Generalizationmentioning
confidence: 99%