2002
DOI: 10.1250/ast.23.69
|View full text |Cite
|
Sign up to set email alerts
|

Enhancement of esophageal speech using formant synthesis.

Abstract: The feasibility of using the formant analysis-synthesis approach to replace the voicing sources of esophageal speech was explored. Using inverse-filtered signals extracted from normal speakers provided the voicing sources. Pitch extraction was tested with various pitch extraction methods, and then a computationally simple, band-limited auto-correlation method was chosen. To accomplish stable and practical speech enhancement, the input signal was divided into low-and highfrequency channels, then only the low-fr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2006
2006
2018
2018

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 23 publications
(18 citation statements)
references
References 6 publications
0
18
0
Order By: Relevance
“…There have been some attempts based on modifications of its acoustic features, e.g., using comb filtering [1] or smoothing of acoustic parameters [2]. Although they have some efficacy in esophageal speech enhancement, it is basically difficult to compensate for the acoustic differences using those simple modification processes since the acoustic features of esophageal speech exhibit quite different properties from those of normal speech.…”
Section: Introductionmentioning
confidence: 99%
“…There have been some attempts based on modifications of its acoustic features, e.g., using comb filtering [1] or smoothing of acoustic parameters [2]. Although they have some efficacy in esophageal speech enhancement, it is basically difficult to compensate for the acoustic differences using those simple modification processes since the acoustic features of esophageal speech exhibit quite different properties from those of normal speech.…”
Section: Introductionmentioning
confidence: 99%
“…Other attempts to enhance the pathological voices, based on the modification of their acoustic features, e.g. by using comb filtering [15], auditory masking [16], and formant synthesis [17], have been proposed. Although these techniques are useful to improve the quality of pathological speech, it is in practice difficult for them to compensate for the acoustic feature differences between pathological and laryngeal speech.…”
Section: Previous and Current Research On Enhancing Pathological mentioning
confidence: 99%
“…This means that ψ (t) can be used to analyze and then reconstruct a signal without loss of information [19]. That is the functions given by (6) constitute an unconditional basis in L 2 (R) [19]; and then we can estimate the expansion coefficients of an audio signal f(t) by using the scalar product between f(t) and the function ψ(t) with translation τ and scaling factor s as follows [19]:…”
Section: Feature Vector Extractionmentioning
confidence: 99%
“…The critical bands theory models the basilar membrane operation as a filter bank in which the bandwidth of each filter increases as its central frequency also increases [6,7]. This requirement can be satisfied using the Bark frequency scale that is a logarithmic scale in which the frequency resolution of any section of the basilar membrane is exactly equal to one Bark, regardless of its characteristic frequency.…”
Section: Feature Vector Extractionmentioning
confidence: 99%
See 1 more Smart Citation