Interspeech 2022 2022
DOI: 10.21437/interspeech.2022-10603
|View full text |Cite
|
Sign up to set email alerts
|

End-to-End Binaural Speech Synthesis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…Liang et al [48] proposed an audio-visual neural field in real-world audio-visual scenes. Many recent works have proposed to generate spatial audio from mono audio using visual cues [62,28,72,90,49,100,30], or the relative pose between sound sources and the receiver [76,39]. Inspired by these works, our feature learning approach learns spatial representations through an audio prediction task.…”
Section: Related Workmentioning
confidence: 99%
“…Liang et al [48] proposed an audio-visual neural field in real-world audio-visual scenes. Many recent works have proposed to generate spatial audio from mono audio using visual cues [62,28,72,90,49,100,30], or the relative pose between sound sources and the receiver [76,39]. Inspired by these works, our feature learning approach learns spatial representations through an audio prediction task.…”
Section: Related Workmentioning
confidence: 99%