2012
DOI: 10.1016/j.csl.2011.03.003
|View full text |Cite
|
Sign up to set email alerts
|

A comparative study of glottal source estimation techniques

Abstract: Source-tract decomposition (or glottal flow estimation) is one of the basic problems of speech processing. For this, several techniques have been proposed in the literature. However, studies comparing different approaches are almost nonexistent. Besides, experiments have been systematically performed either on synthetic speech or on sustained vowels. In this study we compare three of the main representative state-of-the-art methods of glottal flow estimation: closed-phase inverse filtering, iterative and adapt… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
49
0
1

Year Published

2013
2013
2020
2020

Publication Types

Select...
4
3
2

Relationship

4
5

Authors

Journals

citations
Cited by 91 publications
(50 citation statements)
references
References 35 publications
0
49
0
1
Order By: Relevance
“…This process is called inverse filtering [Fritzell 1992, Walker and Murphy 2007, Drugman et al 2012, Gudnason et al 2012. The vocal tract (throat, mouth and in some cases nose) forms the tube, which is characterized by its resonances.…”
Section: Feature Extractionmentioning
confidence: 99%
“…This process is called inverse filtering [Fritzell 1992, Walker and Murphy 2007, Drugman et al 2012, Gudnason et al 2012. The vocal tract (throat, mouth and in some cases nose) forms the tube, which is characterized by its resonances.…”
Section: Feature Extractionmentioning
confidence: 99%
“…Nonetheless, this approach requires the reliable and accurate separation of these components from each other using glottal inverse filtering, which is a difficult inverse problem [10,11]. In the second case, the filter corresponds to the overall spectral envelope of speech and the excitation is the residual signal obtained by feeding the speech signal through the inverse of the estimated filter.…”
Section: Introductionmentioning
confidence: 99%
“…This is especially true for GSGW whose error rate reaches 41% in the noisiest conditions. It is indeed known [6] that the performance of glottal flow estimation techniques rapidly degrades in such environments. Although the proposed OMPD method remains the best approach up to 20dB of SNR, it is clearly outperformed in more severe environments.…”
Section: Robustness To An Additive Noisementioning
confidence: 99%
“…An error on its determination results in a severe impact on the reliability and accuracy performance. There are also some methods of glottal flow estimation and for its parameterization in the time domain which assume a positive speech polarity [6]. This paper proposes a new approach for the automatic detection of speech polarity which is based on the phase shift between two oscillating signals derived from the speech waveform.…”
Section: Introductionmentioning
confidence: 99%