Interspeech 2016 2016
DOI: 10.21437/interspeech.2016-872
|View full text |Cite
|
Sign up to set email alerts
|

Expressive Singing Synthesis Based on Unit Selection for the Singing Synthesis Challenge 2016

Abstract: Sample and statistically based singing synthesizers typically require a large amount of data for automatically generating expressive synthetic performances. In this paper we present a singing synthesizer that using two rather small databases is able to generate expressive synthesis from an input consisting of notes and lyrics. The system is based on unit selection and uses the Wide-Band Harmonic Sinusoidal Model for transforming samples. The first database focuses on expression and consists of less than 2 minu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 35 publications
(33 citation statements)
references
References 9 publications
0
33
0
Order By: Relevance
“…Additionally, phonetics can also cause inflections in F0, so-called microprosody [16]. Some approaches try to decompose these factors to various degrees, for instance by separating vibratos [4] or using source material without consonants [1,17]. In our approach, however, we model the F0 contour as-is, without any decomposition.…”
Section: Pitch Modelmentioning
confidence: 99%
See 3 more Smart Citations
“…Additionally, phonetics can also cause inflections in F0, so-called microprosody [16]. Some approaches try to decompose these factors to various degrees, for instance by separating vibratos [4] or using source material without consonants [1,17]. In our approach, however, we model the F0 contour as-is, without any decomposition.…”
Section: Pitch Modelmentioning
confidence: 99%
“…The more prominent of which are based on concatenative methods [1,2] and statistical parametric methods centered around Hidden Markov Models (HMMs) [3,4]. Similar to in this work, an important benefit of statistical models is that they allow joint modeling of timbre and musical expression from natural singing [18,35].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…into songs, which differs from traditional text-to-speech (TTS) synthesis. Some song synthesizers have been developed based on the unit selection speech synthesis approach [1,2]. Although this approach can achieve high sound quality, it relies on large corpora and its flexibility is limited.…”
Section: Introductionmentioning
confidence: 99%