2018
DOI: 10.48550/arxiv.1802.00285
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Virtual-to-Real: Learning to Control in Visual Semantic Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(19 citation statements)
references
References 0 publications
0
14
0
Order By: Relevance
“…For low level vision tasks, synthetic images have been employed for stereo vision [45] and optical flow estimation [5]. For higher level tasks, computer-aided design (CAD) models have also been extensively used for object detection [27,44,34] or segmentation [18]. Synthetic human figures have been extensively used for learning purposes, such as silhouette-based action recognition tasks [48], and crowd counting [63].…”
Section: Synthetic Human Pose Datamentioning
confidence: 99%
“…For low level vision tasks, synthetic images have been employed for stereo vision [45] and optical flow estimation [5]. For higher level tasks, computer-aided design (CAD) models have also been extensively used for object detection [27,44,34] or segmentation [18]. Synthetic human figures have been extensively used for learning purposes, such as silhouette-based action recognition tasks [48], and crowd counting [63].…”
Section: Synthetic Human Pose Datamentioning
confidence: 99%
“…The performance is excellent in simulation but barely satisfying in the real world due to the lack of modeling the noise in real depth images. The navigation model based on depth image in [23] behaves poorly out of the same reason.…”
Section: Related Workmentioning
confidence: 99%
“…One implicit approach is to adopt an image segmentation network as semantic feature extraction layers and add new layers to output the control commands [24]. Another explicit approach with better performance is to generate a semantic segmentation image first and then feeds it to another network to get waypoints [9] or velocity output [23], [25], [26]. The above works behave well when the simulated training environment is elaborate and the testing scenario is not cluttered.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…3) Simulation approaches [20], [21]: This method can artificially create large datasets by changing various background images and by capturing images of the target object from multiple locations. However, the quality of the dataset is usually low for objects that are difficult to simulate, such as deformable objects.…”
Section: B Automatic Annotationmentioning
confidence: 99%