2000
DOI: 10.1016/s0167-6393(99)00031-x
|View full text |Cite
|
Sign up to set email alerts
|

Estimation of stability and accuracy of inverse problem solution for the vocal tract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2003
2003
2023
2023

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(10 citation statements)
references
References 55 publications
0
10
0
Order By: Relevance
“…[17][18][19][20][21] Sorokin et al 17 chose a regularizing term that prevents inverse solutions from deviating too much from the neutral position of articulators. Schroeter and Sondhi 18 presented a method based on dynamic programming ͑DP͒ to search articulatory codebooks with a penalty factor for large "articulatory efforts," that is, fast changes in the vocal tract so that the estimated articulator trajectories are smoothly evolving.…”
Section: Introductionmentioning
confidence: 99%
“…[17][18][19][20][21] Sorokin et al 17 chose a regularizing term that prevents inverse solutions from deviating too much from the neutral position of articulators. Schroeter and Sondhi 18 presented a method based on dynamic programming ͑DP͒ to search articulatory codebooks with a penalty factor for large "articulatory efforts," that is, fast changes in the vocal tract so that the estimated articulator trajectories are smoothly evolving.…”
Section: Introductionmentioning
confidence: 99%
“…categories: analysis-by-synthesis approaches and datadriven approaches. Several articulatory inversion methods are based on the analysis-by-synthesis approach, which is a closed-loop optimization procedure that involves the comparison of the spectrum of synthesized speech to the measured speech at consecutive frames, for example [4][5][6][7][8].…”
Section: Databasementioning
confidence: 99%
“…(6), can also be expressed as a GMM as follows, (6) where the parameter in Eq. (6) corresponding to conditional mean is calculated by Eq.…”
Section: Acoustic-to-articulatory Mapping Approachmentioning
confidence: 99%
“…In the first place, a full three level WPT decomposition is performed, which splits the frequency components within the range [0, 8] kHz into eight bands; where each is of 1 kHz bandwidth approximately. Then, energy localized in the bands [4,5] kHz, [5,6] kHz, [6,7] kHz, and [7,8] Likewise, to accomplish the time plane partition, the acoustic speech signal is parameterized using 20 ms frames and ∆t = 10 ms steps, so a rate frame of 100 Hz is performed [16]. Acoustic information within time interval ranging from t − t a = t − 200 ms to t + t b = t + 300 ms is parameterized.…”
Section: Acoustic Representation Based On Wavelet Packet Transformmentioning
confidence: 99%
“…The question of how the articulatory information, which come from Electro-Magnetic Articulograph (EMA) systems in present work, is coded in the speech signal remains of practical and theoretical relevance. In particular, the knowledge of the distribution of the articulatory influence on the acoustic speech signal is useful in those applications involving articulatory inversion tasks, whose main goal is to infer the articulators position based on the information immersed in the acoustic speech signal [7], [8].…”
Section: Introductionmentioning
confidence: 99%