Learning vocal tract variables with multi-task kernels

Kadri, Hachem; Duflos, Emmanuel; Preux, Philippe

doi:10.1109/icassp.2011.5946917

Cited by 2 publications

(8 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The problem of speech inversion has received increasing attention in the speech processing community in the recent years (see Schroeter and Sondhi (1994); Mitra et al (2010); Kadri et al (2011a) and references therein). This problem, aka acoustic-articulatory inversion, involves inverting the forward process of speech production (see Figure 4).…”

Section: Speech Inversionmentioning

confidence: 99%

“…In most cases, these works address the articulatory estimation problem within a single-task learning perspective. However, in Richmond (2007) and more recently in Kadri et al (2011a), the authors put forward the idea that we can benefit from viewing the acoustic-articulatory inversion problem from a multi-task learning perspective. Motivated by comparing our functional operator-valued kernel based approach with multivariate kernel methods, we report on experiments similar to those performed by Mitra et al (2009) and Kadri et al (2011a).…”

Section: Speech Inversionmentioning

confidence: 99%

“…However, in Richmond (2007) and more recently in Kadri et al (2011a), the authors put forward the idea that we can benefit from viewing the acoustic-articulatory inversion problem from a multi-task learning perspective. Motivated by comparing our functional operator-valued kernel based approach with multivariate kernel methods, we report on experiments similar to those performed by Mitra et al (2009) and Kadri et al (2011a). The tract variables learning technique proposed by Mitra et al (2009) is based on a hierarchical ε-SVR architecture constructed by associating different SVRs, a SVR for each tract variable.…”

Section: Speech Inversionmentioning

confidence: 99%

“…To consider the dependencies between VT time functions, the SVRs corresponding to independent VT variables are first created and then used for constructing the others. Otherwise, the acousticto-articulatory method in Kadri et al (2011a) is based on learning a vector-valued function using a matrix-valued kernel proposed in Caponnetto et al (2008).…”

Section: Speech Inversionmentioning

confidence: 99%

See 3 more Smart Citations

Operator-valued Kernels for Learning from Functional Response Data

Kadri,

Duflos,

Preux

et al. 2015

Preprint

Self Cite

View full text Add to dashboard Cite

In this paper 1 we consider the problems of supervised classification and regression in the case where attributes and labels are functions: a data is represented by a set of functions, and the label is also a function. We focus on the use of reproducing kernel Hilbert space theory to learn from such functional data. Basic concepts and properties of kernel-based learning are extended to include the estimation of function-valued functions. In this setting, the representer theorem is restated, a set of rigorously defined infinite-dimensional operatorvalued kernels that can be valuably applied when the data are functions is described, and a learning algorithm for nonlinear functional data analysis is introduced. The methodology is illustrated through speech and audio signal processing experiments.

show abstract

Section: Speech Inversionmentioning

confidence: 99%

Section: Speech Inversionmentioning

confidence: 99%

Section: Speech Inversionmentioning

confidence: 99%

Section: Speech Inversionmentioning

confidence: 99%

See 2 more Smart Citations

Operator-valued Kernels for Learning from Functional Response Data

Kadri,

Duflos,

Preux

et al. 2015

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Although several attempts have been made during more than thirty years, the speech researchers still regard the acoustic-to-articulatory inversion as an open issue [2][3][4]. Roughly, inversion methods can be divided into two This paper seeks to show that using VTLN (Vocal Tract Length Normalization) in conjunction with statistically relevant parameters produces effective results for the case of acoustic-to-articulatory inversion in a speakerindependent way.…”

Section: Introductionmentioning

confidence: 99%