2010
DOI: 10.1109/tbme.2010.2053369
|View full text |Cite
|
Sign up to set email alerts
|

Reconstruction of Normal Sounding Speech for Laryngectomy Patients Through a Modified CELP Codec

Abstract: Whispered speech can be useful for quiet and private communication, and is the primary means of unaided spoken communication for many people experiencing voice-box deficiencies. Patients who have undergone partial or full laryngectomy are typically unable to speak anything more than hoarse whispers, without the aid of prostheses or specialized speaking techniques. Each of the current prostheses and rehabilitative methods for post-laryngectomized patients (primarily oesophageal speech, tracheo-esophageal punctu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
42
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 67 publications
(42 citation statements)
references
References 24 publications
0
42
0
Order By: Relevance
“…On the other hand, the latter method is capable of significantly improving natural-ness by converting acoustic parameters of EL speech into those of natural voices using statistical VC techniques [5], [6]. The use of statistics extracted from a parallel data set consisting of EL speech and natural voices makes it possible to achieve more complex conversion processes than that of other signal processing approaches, such as formant manipulation [7]. For example, it is possible to convert from a spectral parameter sequence of EL speech into F 0 patterns of natural voices.…”
Section: Introductionmentioning
confidence: 99%
“…On the other hand, the latter method is capable of significantly improving natural-ness by converting acoustic parameters of EL speech into those of natural voices using statistical VC techniques [5], [6]. The use of statistics extracted from a parallel data set consisting of EL speech and natural voices makes it possible to achieve more complex conversion processes than that of other signal processing approaches, such as formant manipulation [7]. For example, it is possible to convert from a spectral parameter sequence of EL speech into F 0 patterns of natural voices.…”
Section: Introductionmentioning
confidence: 99%
“…Couple of methods are available for converting whispers to normal speech [19], [20], [21], [22], [23]. The driving idea of all these methods is based on the assumption of whispers are missing some acoustic and spectral features comparing with normal speech; hence, the problem of converting whispers to normal speech is formalised as a reconstruction issue [4], [24].…”
Section: Introductionmentioning
confidence: 99%
“…These reconstruction methods (either training-based or nontraining) have different disadvantages such as problems in converting continuous speech (due to using phoneme switching) [20], being computationally expensive (due to using highly overlapped frames for spectral enhancement, or using jump Markov linear system for pitch and voicing parameters) [19], [4], and more importantly lack of naturalness in regenerated output (due to simplified time alignment and spectral features assumptions) [21], [23]. In this paper, we focus on a trainingbased approach, and propose a novel reconstruction algorithm to improve the efficiency in phonated speech regeneration.…”
Section: Introductionmentioning
confidence: 99%
“…However, whispering is usually too weak in volume to be practical in everyday conversation. To overcome these problems researchers have also attempted to capture whispered speech and re-synthesize normal speech externally [8]. However, this approach is sensitive to background noise in the environment.…”
Section: Introductionmentioning
confidence: 99%