Mosam Dabhi scite author profile

Mosam Dabhi

5Publications

40Citation Statements Received

204Citation Statements Given

How they've been cited

How they cite others

144

203

Affiliations

Carnegie Mellon University

Publications

Order By: Most citations

Real-Time Information-Theoretic Exploration with Gaussian Mixture Model Maps

Tabib

Goel

Yao

et al. 2019

View full text Add to dashboard Cite

This paper develops an exploration framework that leverages Gaussian mixture models (GMMs) for high-fidelity perceptual modeling and exploits the compactness of the distributions for information sharing in communications-constrained applications. State-of-the-art, high-resolution perceptual modeling techniques do not always consider the implications of transferring the model across limited bandwidth communications channels, which is critical for real-time information sharing. To bridge this gap in the state of the art, this paper presents a system that compactly represents sensor observations as GMMs and maintains a local occupancy grid map for a sampling-based motion planner that maximizes an information-theoretic objective function. The method is extensively evaluated in long duration simulations on an embedded PC and deployed to an aerial robot equipped with a 3D LiDAR. The result is significant memory efficiency as compared to state-of-the-art techniques.

show abstract

Fast and Agile Vision-Based Flight with Teleoperation and Collision Avoidance on a Multirotor

Spitzer

Yang

Yao

et al. 2020

View full text Add to dashboard Cite

We present a multirotor architecture capable of aggressive autonomous flight and collision-free teleoperation in unstructured, GPS-denied environments. The proposed system enables aggressive and safe autonomous flight around clutter by integrating recent advancements in visual-inertial state estimation and teleoperation. Our teleoperation framework maps user inputs onto smooth and dynamically feasible motion primitives. Collision-free trajectories are ensured by querying a locally consistent map that is incrementally constructed from forward-facing depth observations. Our system enables a non-expert operator to safely navigate a multirotor around obstacles at speeds of 10 m/s. We achieve autonomous flights at speeds exceeding 12 m/s and accelerations exceeding 12 m/s 2 in a series of outdoor field experiments that validate our approach.Authors are with the

show abstract

High Fidelity 3D Reconstructions with Limited Physical Views

Dabhi

Wang

Saluja

et al. 2021

View full text Add to dashboard Cite

High Fidelity 3D Reconstructions with Limited Physical Views

Dabhi¹,

Wang²,

Saluja³

et al. 2021

Preprint

View full text Add to dashboard Cite

Multi-view triangulation is the gold standard for 3D reconstruction from 2D correspondences given known calibration and sufficient views. However in practice, expensive multi-view setups -involving tens sometimes hundreds of cameras -are required in order to obtain the high fidelity 3D reconstructions necessary for many modern applications. In this paper we present a novel approach that leverages recent advances in 2D-3D lifting using neural shape priors while also enforcing multi-view equivariance. We show how our method can achieve comparable fidelity to expensive calibrated multi-view rigs using a limited (2-3) number of uncalibrated camera views.

show abstract

MBW: Multi-view Bootstrapping in the Wild

Dabhi¹,

Wang²,

Clifford³

et al. 2022

Preprint

View full text Add to dashboard Cite

Labeling articulated objects in unconstrained settings have a wide variety of applications including entertainment, neuroscience, psychology, ethology, and many fields of medicine. Large offline labeled datasets do not exist for all but the most common articulated object categories (e.g., humans). Hand labeling these landmarks within a video sequence is a laborious task. Learned landmark detectors can help, but can be error-prone when trained from only a few examples. Multi-camera systems that train fine-grained detectors have shown significant promise in detecting such errors, allowing for self-supervised solutions that only need a small percentage of the video sequence to be hand-labeled. The approach, however, is based on calibrated cameras and rigid geometry, making it expensive, difficult to manage, and impractical in real-world scenarios. In this paper, we address these bottlenecks by combining a non-rigid 3D neural prior with deep flow to obtain high-fidelity landmark estimates from videos with only two or three uncalibrated, handheld cameras. With just a few annotations (representing 1-2% of the frames), we are able to produce 2D results comparable to state-of-the-art fully supervised methods, along with 3D reconstructions that are impossible with other existing approaches. Our Multi-view Bootstrapping in the Wild (MBW) approach demonstrates impressive results on standard human datasets, as well as tigers, cheetahs, fish, colobus monkeys, chimpanzees, and flamingos from videos captured casually in a zoo. We release the codebase for MBW as well as this challenging zoo dataset consisting image frames of tail-end distribution categories with their corresponding 2D, 3D labels generated from minimal human intervention. * indicates the authors advised equally 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mosam Dabhi

Real-Time Information-Theoretic Exploration with Gaussian Mixture Model Maps

Fast and Agile Vision-Based Flight with Teleoperation and Collision Avoidance on a Multirotor

High Fidelity 3D Reconstructions with Limited Physical Views

High Fidelity 3D Reconstructions with Limited Physical Views

MBW: Multi-view Bootstrapping in the Wild

Contact Info

Product

Resources

About