Siddharth Mahendran scite author profile

Abstract3D pose estimation is a key component of many important computer vision tasks such as autonomous navigation and 3D scene understanding. Most state-of-the-art approaches to 3D pose estimation solve this problem as a pose-classification problem in which the pose space is discretized into bins and a CNN classifier is used to predict a pose bin. We argue that the 3D pose space is continuous and propose to solve the pose estimation problem in a CNN regression framework with a suitable representation, data augmentation and loss function that captures the geometry of the pose space. Experiments on PASCAL3D+ show that the proposed 3D pose regression approach achieves competitive performance compared to the state-of-the-art.

show abstract

Understanding Objects in Detail with Fine-Grained Attributes

Vedaldi

Mahendran

Tsogkas

et al. 2014

View full text Add to dashboard Cite

We study the problem of understanding objects in detail, intended as recognizing a wide array of fine-grained object attributes. To this end, we introduce a dataset of 7,413 airplanes annotated in detail with parts and their attributes, leveraging images donated by airplane spotters and crowdsourcing both the design and collection of the detailed annotations. We provide a number of insights that should help researchers interested in designing fine-grained datasets for other basic level categories. We show that the collected data can be used to study the relation between part detection and attribute prediction by diagnosing the performance of classifiers that pool information from different parts of an object. We note that the prediction of certain attributes can benefit substantially from accurate part detection. We also show that, differently from previous results in object detection, employing a large number of part templates can improve detection accuracy at the expenses of detection speed. We finally propose a coarse-to-fine approach to speed up detection through a hierarchical cascade algorithm. 1 We already introduced a superset of these aircraft images for FGcomp 2013 [28], but without detailed annotations.

show abstract

3D Pose Regression Using Convolutional Neural Networks

Mahendran

Ali

Vidal

2017

View full text Add to dashboard Cite

show abstract

Ensemble Systems for Automatic Fracture Detection

Mahendran¹,

Baboo²

2012

IJET

View full text Add to dashboard Cite

Fracture detection based on image classification is an area of research which has proved to be challenging for the past several decades. This field has gained more attention due to the new challenges posed by voluminous image databases. In this research work, fusion-based classifiers are constructed, which extracts features from the images, use these features to train and test the classifiers for the purpose of detecting fractures in X-Ray images. The various features extracted are Contrast, Homogeneity, Energy, Entropy, Mean, Variance, Standard Deviation, Correlation, Gabor orientation (GO), Markov Random Field (MRF), and intensity gradient direction (IGD). Three classifiers, BPNN, SVM and NB classifiers are used. Using these features and classifiers, three single classifiers and four multiple classifiers were developed. All the classifiers were tested vigorously with the test dataset for evaluating the winner combination of classifiers and features that correctly identifies fractures in a bone image. The performance metrics used are sensitivity, specificity, positive predictive value, negative predictive value, accuracy and execution time. The experimental results showed that usage of fusion classifiers enhances the detection capacity and the combination SVM and BPNN produces the best result.

show abstract

Convolutional Networks for Object Category and 3D Pose Estimation from 2D Images

Mahendran

Ali

Vidal

2019

View full text Add to dashboard Cite

Current CNN-based algorithms for recovering the 3D pose of an object in an image assume knowledge about both the object category and its 2D localization in the image. In this paper, we relax one of these constraints and propose to solve the task of joint object category and 3D pose estimation from an image assuming known 2D localization. We design a new architecture for this task composed of a feature network that is shared between subtasks, an object categorization network built on top of the feature network, and a collection of category dependent pose regression networks. We also introduce suitable loss functions and a training method for the new architecture. Experiments on the challenging PASCAL3D+ dataset show state-of-the-art performance in the joint categorization and pose estimation task. Moreover, our performance on the joint task is comparable to the performance of state-of-the-art methods on the simpler 3D pose estimation with known object category task.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.