2019 IEEE Winter Conference on Applications of Computer Vision (WACV) 2019
DOI: 10.1109/wacv.2019.00210
|View full text |Cite
|
Sign up to set email alerts
|

Domain Randomization for Scene-Specific Car Detection and Pose Estimation

Abstract: We address the issue of domain gap when making use of synthetic data to train a scene-specific object detector and pose estimator. While previous works have shown that the constraints of learning a scene-specific model can be leveraged to create geometrically and photometrically consistent synthetic data, care must be taken to design synthetic content which is as close as possible to the real-world data distribution. In this work, we propose to solve domain gap through the use of appearance randomization to ge… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 36 publications
(19 citation statements)
references
References 36 publications
0
19
0
Order By: Relevance
“…As shown in Fig. 3, we categorize them into three groups, namely: Domain randomization [24,25,26,27,28,29,30] Adversarial data augmentation [31,32,33] Data generation [34,35,36,37,38,39,30,40,41,42,43,44,45,46,157] Representation learning…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…As shown in Fig. 3, we categorize them into three groups, namely: Domain randomization [24,25,26,27,28,29,30] Adversarial data augmentation [31,32,33] Data generation [34,35,36,37,38,39,30,40,41,42,43,44,45,46,157] Representation learning…”
Section: Methodsmentioning
confidence: 99%
“…Tobin et al [25] first used this method to generate more training data from the simulated environment for generalization in the real environment. Similar techniques were also used in [26,27,28,24] to strengthen the generalization capability of the models. Prakash et al [29] further took into account the structure of the scene when randomly placing objects for data generation, which enables the neural network to learn to utilize context when detecting objects.…”
Section: Data Augmentation-based Dgmentioning
confidence: 99%
“…They rely on model data during inference, from which we refrain from for the reasons stated in section 2.4. The works of Ren [67], Khirodkar [68] and Tremblay [69] focus on detection and 3-dof estimation. Yet they offer a great contribution showing the potential of DR in overcoming the reality gap, increasing overall accuracy.…”
Section: Bridging the Reality Gapmentioning
confidence: 99%
“…Once trained, Task2Sim can be used not only for "seen" tasks but also can be used in one-shot to generate simulation parameters for novel "unseen" tasks. similar to ours, is domain randomization [2,26,46,61,77], which learns pre-trained models from datasets generated by randomly varying simulator parameters. In contrast, Task2Sim learns simulator parameters to generate synthetic datasets that maximize transfer learning performance.…”
Section: Related Workmentioning
confidence: 99%
“…Our extensive experiments using 20 downstream classification datasets show that on seen tasks, given a number of images per category, Task2Sim's output parameters generate pre-training datasets that are much better for downstream performance than approaches like domain randomization [2,26,77] that are not task-adaptive. Moreover, we show Task2Sim also generalizes well to unseen tasks, maintaining an edge over non-adaptive approaches while being competitive with Imagenet pre-training.…”
Section: Introductionmentioning
confidence: 99%