Ehtesham Hassan scite author profile

Object detection in real images is a challenging problem in computer vision. Despite several advancements in detection and recognition techniques, robust and accurate localization of interesting objects in images from real-life scenarios remains unsolved because of the difficulties posed by intraclass and interclass variations, occlusion, lightning, and scale changes at different levels. In this work, we present an object detection framework by learning-based fusion of handcrafted features with deep features. Deep features characterize different regions of interest in a testing image with a rich set of statistical features. Our hypothesis is to reinforce these features with handcrafted features by learning the optimal fusion during network training. Our detection framework is based on the recent version of YOLO object detection architecture. Experimental evaluation on PASCAL-VOC and MS-COCO datasets achieved the detection rate increase of 11.4% and 1.9% on the mAP scale in comparison with the YOLO version-3 detector (Redmon and Farhadi 2018). An important step in the proposed learning-based feature fusion strategy is to correctly identify the layer feeding in new features. The present work shows a qualitative approach to identify the best layer for fusion and design steps for feeding in the additional feature sets in convolutional network-based detectors.

show abstract

Shape Descriptor Based Document Image Indexing and Symbol Recognition

Hassan

Chaudhury

Gopal

2009

View full text Add to dashboard Cite

Robust Hand Gestural Interaction for Smartphone Based AR/VR Applications

Mohatta

Perla

Gupta

et al. 2017

View full text Add to dashboard Cite

GestAR: Real Time Gesture Interaction for AR with Egocentric View

Hegde

Perla

Hebbalaguppe

et al. 2016

View full text Add to dashboard Cite

Pedestrian Detection via Mixture of CNN Experts and Thresholded Aggregated Channel Features

Verma¹,

Hebbalaguppe²,

Vig³

et al. 2015

View full text Add to dashboard Cite

Feature Combination in Kernel Space for Distance Based Image Hashing

Hassan

Chaudhury

Gopal

2012

IEEE Trans. Multimedia

View full text Add to dashboard Cite

12 3 4 5 6

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ehtesham Hassan

Telecom Inventory Management via Object Recognition and Localisation on Google Street View Images

Robust visual analysis for planogram compliance problem

Learning Feature Fusion in Deep Learning-Based Object Detector

Shape Descriptor Based Document Image Indexing and Symbol Recognition

Robust Hand Gestural Interaction for Smartphone Based AR/VR Applications

GestAR: Real Time Gesture Interaction for AR with Egocentric View

Pedestrian Detection via Mixture of CNN Experts and Thresholded Aggregated Channel Features

Feature Combination in Kernel Space for Distance Based Image Hashing

Contact Info

Product

Resources

About