Elia Bonetto scite author profile

Over the years, the Internet has been enriched with new available communication technologies, for both fixed and mobile networks and devices, exhibiting an impressive growth in terms of performance, with steadily increasing available data rates. The Internet research community has kept trying to evolve the transport layer protocols to match the capabilities of modern networks, in order to fully reap the benefits of the new communication technologies. This paper surveys the main novelties related to transport protocols that have been recently proposed, identifying three main research trends: (i) the evolution of congestion control algorithms, to target optimal performance in challenging scenarios, possibly with the application of machine learning techniques; (ii) the proposal of brand new transport protocols, alternative to the Transmission Control Protocol (TCP) and implemented in the user-space; and (iii) the introduction of multipath capabilities at the transport layer.

show abstract

AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape Estimation

Saini

Bonetto

Price

et al. 2022

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

In this letter, we present a novel markerless 3D human motion capture (MoCap) system for unstructured, outdoor environments that uses a team of autonomous unmanned aerial vehicles (UAVs) with on-board RGB cameras and computation. Existing methods are limited by calibrated cameras and offline processing. Thus, we present the first method (AirPose) to estimate human pose and shape using images captured by multiple extrinsically uncalibrated flying cameras. AirPose itself calibrates the cameras relative to the person instead of relying on any pre-calibration. It uses distributed neural networks running on each UAV that communicate viewpoint-independent information with each other about the person (i.e., their 3D shape and articulated pose). The person's shape and pose are parameterized using the SMPL-X body model, resulting in a compact representation, that minimizes communication between the UAVs. The network is trained using synthetic images of realistic virtual environments, and fine-tuned on a small set of real images. We also introduce an optimization-based post-processing method (AirPose + ) for offline applications that require higher MoCap quality. We make our method's code and data available for research at https://github.com/robot-perception-group/AirPose. A video describing the approach and results is available at https://youtu.be/xLYe1TNHsfs.

show abstract

AirCapRL: Autonomous Aerial Human Motion Capture Using Deep Reinforcement Learning

Tallamraju

Saini

Bonetto

et al. 2020

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

In this letter, we introduce a deep reinforcement learning (DRL) based multi-robot formation controller for the task of autonomous aerial human motion capture (MoCap). We focus on vision-based MoCap, where the objective is to estimate the trajectory of body pose and shape of a single moving person using multiple micro aerial vehicles. State-of-the-art solutions to this problem are based on classical control methods, which depend on hand-crafted system and observation models. Such models are difficult to derive and generalize across different systems. Moreover, the non-linearities and non-convexities of these models lead to sub-optimal controls. In our work, we formulate this problem as a sequential decision making task to achieve the vision-based motion capture objectives, and solve it using a deep neural network-based RL method. We leverage proximal policy optimization (PPO) to train a stochastic decentralized control policy for formation control. The neural network is trained in a parallelized setup in synthetic environments. We performed extensive simulation experiments to validate our approach. Finally, real-robot experiments demonstrate that our policies generalize to real world conditions.

show abstract

iRotate: Active Visual SLAM for Omnidirectional Robots

Bonetto

Goldschmid

Pabst

et al. 2022

Robotics and Autonomous Systems

View full text Add to dashboard Cite

GRADE: Generating Realistic Animated Dynamic Environments for Robotics Research

Bonetto¹,

Xu²,

Ahmad³

2023

Preprint

View full text Add to dashboard Cite

Simulation engines like Gazebo, Unity and Webots are widely adopted in robotics. However, they lack either full simulation control, ROS integration, realistic physics, or photorealism. Recently, synthetic data generation and realistic rendering advanced tasks like target tracking and human pose estimation. However, when focusing on vision applications, there is usually a lack of information like sensor measurements (e.g. IMU, LiDAR, joint state), or time continuity. On the other hand, simulations for most robotics applications are obtained in (semi)static environments, with specific sensor settings and low visual fidelity. In this work, we present a solution to these issues with a fully customizable framework for generating realistic animated dynamic environments (GRADE) for robotics research. The data produced can be post-processed, e.g. to add noise, and easily expanded with new information using the tools that we provide. To demonstrate GRADE, we use it to generate an indoor dynamic environment dataset and then compare different SLAM algorithms on the produced sequences. By doing that, we show how current research over-relies on well-known benchmarks and fails to generalize. Furthermore, our tests with YOLO and Mask R-CNN provide evidence that our data can improve training performance and generalize to real sequences. Finally, we show GRADE's flexibility by using it for indoor active SLAM, with diverse environment sources, and in a multi-robot scenario. In doing that, we employ different control, asset placement, and simulation techniques. The code, results, implementation details, and generated data are provided as open-source. The main project page is https://eliabntt.github.io/grade-rr while the accompanying video can be found at https://youtu.be/cmywCSD-9TU.

show abstract

AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning

Tallamraju¹,

Saini²,

Bonetto³

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

Active Visual SLAM with Independently Rotating Camera

Bonetto

Goldschmid

Black

et al. 2021

View full text Add to dashboard Cite

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

Terreran

Bonetto

Ghidoni

2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Elia Bonetto

A Survey on Recent Advances in Transport Layer Protocols

AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape Estimation

AirCapRL: Autonomous Aerial Human Motion Capture Using Deep Reinforcement Learning

iRotate: Active Visual SLAM for Omnidirectional Robots

GRADE: Generating Realistic Animated Dynamic Environments for Robotics Research

AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning

Active Visual SLAM with Independently Rotating Camera

Enhancing Deep Semantic Segmentation of RGB-D Data with Entangled Forests

Contact Info

Product

Resources

About