Ankit Laddha scite author profile

The capability to detect objects is a core part of autonomous driving. Due to sensor noise and incomplete data, perfectly detecting and localizing every object is infeasible. Therefore, it is important for a detector to provide the amount of uncertainty in each prediction. Providing the autonomous system with reliable uncertainties enables the vehicle to react differently based on the level of uncertainty. Previous work has estimated the uncertainty in a detection by predicting a probability distribution over object bounding boxes. In this work, we propose a method to improve the ability to learn the probability distribution by considering the potential noise in the ground-truth labeled data. Our proposed approach improves not only the accuracy of the learned distribution but also the object detection performance.

show abstract

Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation

Meyer

Charland

Hegde

et al. 2019

122

View full text Add to dashboard Cite

In this paper, we present an extension to LaserNet, an efficient and state-of-the-art LiDAR based 3D object detector. We propose a method for fusing image data with the LiDAR data and show that this sensor fusion method improves the detection performance of the model especially at long ranges. The addition of image data is straightforward and does not require image labels. Furthermore, we expand the capabilities of the model to perform 3D semantic segmentation in addition to 3D object detection. On a large benchmark dataset, we demonstrate our approach achieves state-of-the-art performance on both object detection and semantic segmentation while maintaining a low runtime.

show abstract

Map-supervised road detection

Laddha

Kocamaz

Navarro-Serment

et al. 2016

View full text Add to dashboard Cite

MVFuseNet: Improving End-to-End Object Detection and Motion Forecasting through Multi-View Fusion of LiDAR Data

Laddha¹,

Gautam²,

Palombo³

et al. 2021

View full text Add to dashboard Cite

Resolving vision and language ambiguities together: Joint segmentation & prepositional attachment resolution in captioned scenes

Christie

Laddha

Agrawal

et al. 2017

Computer Vision and Image Understanding

View full text Add to dashboard Cite

LaserFlow: Efficient and Probabilistic Object Detection and Motion Forecasting

Meyer

Charland

Pandey

et al. 2021

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

Resolving Language and Vision Ambiguities Together: Joint Segmentation and Prepositional Attachment Resolution in Captioned Scenes

Christie¹,

Laddha²,

Agrawal³

et al. 2016

View full text Add to dashboard Cite

We present an approach to simultaneously perform semantic segmentation and prepositional phrase attachment resolution for captioned images. Some ambiguities in language cannot be resolved without simultaneously reasoning about an associated image. If we consider the sentence "I shot an elephant in my pajamas", looking at language alone (and not using common sense), it is unclear if it is the person or the elephant wearing the pajamas or both. Our approach produces a diverse set of plausible hypotheses for both semantic segmentation and prepositional phrase attachment resolution that are then jointly reranked to select the most consistent pair. We show that our semantic segmentation and prepositional phrase attachment resolution modules have complementary strengths, and that joint reasoning produces more accurate results than any module operating in isolation. Multiple hypotheses are also shown to be crucial to improved multiple-module reasoning. Our vision and language approach significantly outperforms the Stanford Parser (De Marneffe et al., 2006) by 17.91% (28.69% relative) and 12.83% (25.28% relative) in two different experiments. We also make small improvements over DeepLab-CRF (Chen et al., 2015).

show abstract

RV-FuseNet: Range View Based Fusion of Time-Series LiDAR Data for Joint 3D Object Detection and Motion Forecasting

Laddha¹,

Gautam²,

Meyer³

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ankit Laddha

LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving

Sensor Fusion for Joint 3D Object Detection and Semantic Segmentation

Map-supervised road detection

MVFuseNet: Improving End-to-End Object Detection and Motion Forecasting through Multi-View Fusion of LiDAR Data

Resolving vision and language ambiguities together: Joint segmentation & prepositional attachment resolution in captioned scenes

LaserFlow: Efficient and Probabilistic Object Detection and Motion Forecasting

Resolving Language and Vision Ambiguities Together: Joint Segmentation and Prepositional Attachment Resolution in Captioned Scenes

RV-FuseNet: Range View Based Fusion of Time-Series LiDAR Data for Joint 3D Object Detection and Motion Forecasting

Contact Info

Product

Resources

About