Willi Menapace scite author profile

Deep learning (DL) has proved successful in medical imaging and, in the wake of the recent COVID-19 pandemic, some works have started to investigate DLbased solutions for the assisted diagnosis of lung diseases. While existing works focus on CT scans, this paper studies the application of DL techniques for the analysis of lung ultrasonography (LUS) images. Specifically, we present a novel fully-annotated dataset of LUS images collected from several Italian hospitals, with labels indicating the degree of disease severity at a frame-level, videolevel, and pixel-level (segmentation masks). Leveraging these data, we introduce several deep models that address relevant tasks for the automatic analysis of LUS images. In particular, we present a novel deep network, derived from Spatial Transformer Networks, which simultaneously predicts the disease severity score associated to a input frame and provides localization of pathological artefacts in a weakly-supervised way. Furthermore, we introduce a new method based on uninorms for effective frame score aggregation at a video-level. Finally, we benchmark state of the art deep models for estimating pixel-level segmentations of COVID-19 imaging biomarkers. Experiments on the proposed dataset demonstrate satisfactory results on all the considered tasks, paving the way to future research on DL for the assisted diagnosis of COVID-19 from LUS data.

show abstract

Playable Video Generation

Menapace

Lathuilière

Tulyakov

et al. 2021

View full text Add to dashboard Cite

Playable Environments: Video Manipulation in Space and Time

Menapace

Lathuilière

Siarohin

et al. 2022

View full text Add to dashboard Cite

Learning to Cluster Under Domain Shift

Menapace

Lathuilière

Ricci

2020

View full text Add to dashboard Cite

Quantum Motion Segmentation

Arrigoni

Menapace

Seelbach

et al. 2022

View full text Add to dashboard Cite

Learning to Cluster under Domain Shift

Menapace¹,

Lathuilière²,

Ricci³

2020

Preprint

View full text Add to dashboard Cite

While unsupervised domain adaptation methods based on deep architectures have achieved remarkable success in many computer vision tasks, they rely on a strong assumption, i.e. labeled source data must be available. In this work we overcome this assumption and we address the problem of transferring knowledge from a source to a target domain when both source and target data have no annotations. Inspired by recent works on deep clustering, our approach leverages information from data gathered from multiple source domains to build a domain-agnostic clustering model which is then refined at inference time when target data become available. Specifically, at training time we propose to optimize a novel information-theoretic loss which, coupled with domain-alignment layers, ensures that our model learns to correctly discover semantic labels while discarding domain-specific features. Importantly, our architecture design ensures that at inference time the resulting source model can be effectively adapted to the target domain without having access to source data, thanks to feature alignment and self-supervision. We evaluate the proposed approach in a variety of settings * , considering several domain adaptation benchmarks and we show that our method is able to automatically discover relevant semantic information even in presence of few target samples and yields state-of-the-art results on multiple domain adaptation benchmarks.

show abstract

Plotting Behind the Scenes: Towards Learnable Game Engines

Menapace¹,

Siarohin²,

Lathuilière³

et al. 2023

Preprint

View full text Add to dashboard Cite

Playable Video Generation

Menapace¹,

Lathuilière²,

Tulyakov³

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper introduces the unsupervised learning problem of playable video generation (PVG). In PVG, we aim at allowing a user to control the generated video by selecting a discrete action at every time step as when playing a video game. The difficulty of the task lies both in learning semantically consistent actions and in generating realistic videos conditioned on the user input. We propose a novel framework for PVG that is trained in a self-supervised manner on a large dataset of unlabelled videos. We employ an encoder-decoder architecture where the predicted action labels act as bottleneck. The network is constrained to learn a rich action space using, as main driving loss, a reconstruction loss on the generated video. We demonstrate the effectiveness of the proposed approach on several datasets with wide environment variety. Further details, code and examples are available on our project page willimenapace.github.io/playable-video-generation-website.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Willi Menapace

Deep Learning for Classification and Localization of COVID-19 Markers in Point-of-Care Lung Ultrasound

Playable Video Generation

Playable Environments: Video Manipulation in Space and Time

Learning to Cluster Under Domain Shift

Quantum Motion Segmentation

Learning to Cluster under Domain Shift

Plotting Behind the Scenes: Towards Learnable Game Engines

Playable Video Generation

Contact Info

Product

Resources

About