Amanda Duarte scite author profile

Amanda Duarte

4Publications

130Citation Statements Received

65Citation Statements Given

How they've been cited

272

129

How they cite others

Affiliations

Barcelona Supercomputing Center, Universidade Federal do Rio Grande, Universitat Politècnica de Catalunya

Publications

Order By: Most citations

A dataset to evaluate underwater image restoration methods

Duarte

Codevilla

Gaya

et al. 2016

View full text Add to dashboard Cite

Image restoration methods have been made to repair images that have some kind of degradation. Most of these methods are designed to deal with the degradation caused by the over-land effects. However, when the images was captured in underwater environments, there are different properties that can degrade the image in unusual ways. This work aims to evaluate how the popular image restoration methods behaves when applied in underwater images with the presence of turbidity in the water. For this, we propose a dataset where we are able to control the amount of image degradation due to underwater properties on a scenario with 3D objects that represents the seabed characteristics. After that, we evaluate the restoration of these methods and their behavior through the image degradation due to turbidity.

show abstract

Wav2Pix: Speech-conditioned Face Generation Using Generative Adversarial Networks

Duarte

Roldan

Tubau

et al. 2019

View full text Add to dashboard Cite

Speech is a rich biometric signal that contains information about the identity, gender and emotional state of the speaker. In this work, we explore its potential to generate face images of a speaker by conditioning a Generative Adversarial Network (GAN) with raw speech input. We propose a deep neural network that is trained from scratch in an end-to-end fashion, generating a face directly from the raw speech waveform without any additional identity information (e.g reference image or one-hot encoding). Our model is trained in a self-supervised approach by exploiting the audio and visual signals naturally aligned in videos. With the purpose of training from video data, we present a novel dataset collected for this work, with highquality videos of youtubers with notable expressiveness in both the speech and visual signals.

show abstract

Cross-modal Embeddings for Video and Audio Retrieval

Surís

Duarte

Salvador

et al. 2019

View full text Add to dashboard Cite

The increasing amount of online videos brings several opportunities for training self-supervised neural networks. The creation of large scale datasets of videos such as the YouTube-8M allows us to deal with this large amount of data in manageable way. In this work, we find new ways of exploiting this dataset by taking advantage of the multi-modal information it provides. By means of a neural network, we are able to create links between audio and visual documents, by projecting them into a common region of the feature space, obtaining joint audio-visual embeddings. These links are used to retrieve audio samples that fit well to a given silent video, and also to retrieve images that match a given a query audio. The results in terms of Recall@K obtained over a subset of YouTube-8M videos show the potential of this unsupervised approach for cross-modal feature learning. We train embeddings for both scales and assess their quality in a retrieval problem, formulated as using the feature extracted from one modality to retrieve the most similar videos based on the features computed in the other modality.

show abstract

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Duarte

Palaskar

Ventura

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Amanda Duarte

A dataset to evaluate underwater image restoration methods

Wav2Pix: Speech-conditioned Face Generation Using Generative Adversarial Networks

Cross-modal Embeddings for Video and Audio Retrieval

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Contact Info

Product

Resources

About