2009 IEEE International Conference on Acoustics, Speech and Signal Processing 2009
DOI: 10.1109/icassp.2009.4960496
|View full text |Cite
|
Sign up to set email alerts
|

Data-driven voice soruce waveform modelling

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
15
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
6
1
1

Relationship

3
5

Authors

Journals

citations
Cited by 21 publications
(16 citation statements)
references
References 15 publications
1
15
0
Order By: Relevance
“…In speech analysis nomenclature, these timing instants are called glottal closure instants (GCIs) and glottal opening instants (GOIs). Applications of GCI and GOI estimation are numerous, including pitch tracking [1], [2], voice source modeling [3]- [6], speech enhancement [7], closedphase analysis and glottal flow estimation [8]- [11], speaker identification [9], [12], [13], speech dereverberation [14], speech synthesis [15], [16], speech coding [17], speech modification [18], [19] and speech transformations [20].…”
Section: Introductionmentioning
confidence: 99%
“…In speech analysis nomenclature, these timing instants are called glottal closure instants (GCIs) and glottal opening instants (GOIs). Applications of GCI and GOI estimation are numerous, including pitch tracking [1], [2], voice source modeling [3]- [6], speech enhancement [7], closedphase analysis and glottal flow estimation [8]- [11], speaker identification [9], [12], [13], speech dereverberation [14], speech synthesis [15], [16], speech coding [17], speech modification [18], [19] and speech transformations [20].…”
Section: Introductionmentioning
confidence: 99%
“…For this reason, existing ABWE approaches often avoid lowband extension altogether. This paper presents a novel method for the extension of narrowband source signals based on an existing spectral mirroring technique and Data-Driven Voice Source Modelling (DDVSM) [7], employing GMMs to establish an explicit mapping between narrowband source features and the wideband source signal. Using an existing ABWE framework [5] that applies HMM-based Bayesian estimation of spectral and temporal envelopes [1], missing frequency content in both high and low bands is synthesized and added to the narrowband signal to form an estimated wideband signal.…”
Section: Introductionmentioning
confidence: 99%
“…Data-Driven Voice Source Modelling (DDVSM) [7] is a technique for classifying voice source signals. One such implementation uses a large database of training data to estimate class distributions in the MFCC feature space, from which a set of corresponding 'prototype' time-domain waveforms are derived.…”
Section: Introduction To Data-driven Voice Source Modellingmentioning
confidence: 99%
“…Other techniques jointly estimate and [30] that are not considered here. Re-writing (1) in the time domain (3) where are the prediction coefficients, is an estimate of , and is the prediction order. The vocal tract transfer function can be approximated as (4) The prediction order for an adult male of vocal tract length 17 cm is approximately , where is the sampling frequency.…”
Section: B Estimation By Linear Predictionmentioning
confidence: 99%
“…The detection of GCIs in voiced speech is important for glottal-synchronous speech processing algorithms such as pitch tracking, prosodic speech modification [1], speech dereverberation [2], data-driven voice source modeling [3] and areas of speech synthesis [4]. Identification of GOIs is necessary for closed-phase linear predictive coding (LPC) [5] and the analysis of pathological speech that relies upon knowledge of the open quotient (OQ) [6].…”
mentioning
confidence: 99%