Param S. Rajpura scite author profile

The need for large annotated image datasets for training Convolutional Neural Networks (CNNs) has been a significant impediment for their adoption in computer vision applications. We show that with transfer learning an effective object detector can be trained almost entirely on synthetically rendered datasets. We apply this strategy for detecting packaged food products clustered in refrigerator scenes. Our CNN trained only with 4000 synthetic images achieves mean average precision (mAP) of 24 on a test set with 55 distinct products as objects of interest and 17 distractor objects. A further increase of 12% in the mAP is obtained by adding only 400 real images to these 4000 synthetic images in the training set. A high degree of photorealism in the synthetic images was not essential in achieving this performance. We analyze factors like training data set size and 3D model dictionary size for their influence on detection performance. Additionally, training strategies like fine-tuning with selected layers and early stopping which affect transfer learning from synthetic scenes to real scenes are explored. Training CNNs with synthetic datasets is a novel application of high-performance computing and a promising approach for object detection applications in domains where there is a dearth of large annotated image data.

show abstract

Transfer Learning by Finetuning Pretrained CNNs Entirely with Synthetic Images

Rajpura¹,

Aggarwal²,

Goyal³

et al. 2018

View full text Add to dashboard Cite

Dataset Augmentation with Synthetic Images Improves Semantic Segmentation

Goyal

Rajpura

Bojinov³

et al. 2018

View full text Add to dashboard Cite

Although Deep Convolutional Neural Networks trained with strong pixel-level annotations have significantly pushed the performance in semantic segmentation, annotation efforts required for the creation of training data remains a roadblock for further improvements. We show that augmentation of the weakly annotated training dataset with synthetic images minimizes both the annotation efforts and also the cost of capturing images with sufficient variety. Evaluation on the PASCAL 2012 validation dataset shows an increase in mean IOU from 52.80% to 55.47% by adding just 100 synthetic images per object class. Our approach is thus a promising solution to the problems of annotation and dataset collection.Authors contributed equally to this manuscript arXiv:1709.00849v3 [cs.CV]

show abstract

Autonomous Navigation Using Monocular ORB SLAM2

Vithalani

Soni

Rajpura

2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Param S. Rajpura

Transfer Learning for Object Detection using State-of-the-Art Deep Neural Networks

Object Detection Using Deep CNNs Trained on Synthetic Images

Transfer Learning by Finetuning Pretrained CNNs Entirely with Synthetic Images

Dataset Augmentation with Synthetic Images Improves Semantic Segmentation

Autonomous Navigation Using Monocular ORB SLAM2

Contact Info

Product

Resources

About