2016
DOI: 10.1109/taslp.2015.2506263
|View full text |Cite
|
Sign up to set email alerts
|

A Fast Method for High-Resolution Voiced/Unvoiced Detection and Glottal Closure/Opening Instant Estimation of Speech

Abstract: Abstract-We propose a fast speech analysis method which simultaneously performs high-resolution voiced/unvoiced detection (VUD) and accurate estimation of glottal closure and glottal opening instants (GCIs and GOIs, respectively). The proposed algorithm exploits the structure of the glottal flow derivative in order to estimate GCIs and GOIs only in voiced speech using simple time-domain criteria. We compare our method with well-known GCI/GOI methods, namely, the dynamic programming projected phase-slope algori… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
11
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 30 publications
(11 citation statements)
references
References 46 publications
(72 reference statements)
0
11
0
Order By: Relevance
“…In state of the art, the researchers had proposed many methods and algorithms to determine the instants from EGG [1, 2, 9] and speech/acoustic signal [3–5, 10–13]. The main disadvantages of the existing methods were less accuracy in detection of GCIs and false detection of GOIs for the vulnerable cases of voicing.…”
Section: Introductionmentioning
confidence: 99%
“…In state of the art, the researchers had proposed many methods and algorithms to determine the instants from EGG [1, 2, 9] and speech/acoustic signal [3–5, 10–13]. The main disadvantages of the existing methods were less accuracy in detection of GCIs and false detection of GOIs for the vulnerable cases of voicing.…”
Section: Introductionmentioning
confidence: 99%
“…From the studies in [18], it was observed that most of the epoch detection methods were shown to provide good accuracy on the speech data collected in the lab environments. Also, some attempts were made to see the effectiveness of these methods for additive noise degraded conditions [19][20][21][22][23]. However, there are not many attempts in GCI detection for the degraded conditions like telephone quality speech.…”
Section: Introductionmentioning
confidence: 99%
“…YAGA is one such method [30], which uses the glottal flow waveform, wavelet transform, group delay and dynamic programming. Also, recently a method was proposed which uses the glottal flow waveform signal with the time domain criteria for detecting GCIs by forward-backward algorithm in [20].…”
Section: Introductionmentioning
confidence: 99%
“…We can broadly (not exhaustively) classify the available GCI detection methods into i) classical signal processing [20,21,22,23,2,24,25,26] and ii) most recent classification based data driven [4,27,28] approaches. Most of the popular signal processing GCI detection methods relies on designing signal processing pipelines to obtain the exemplary signal which emphasizes the locations of GCIs in the speech signal [28].…”
Section: Introductionmentioning
confidence: 99%
“…Further, GCIs from the exemplary signal is obtained from hand-crafted heuristics. Two approaches are popular for exemplary signal extraction: a) source/filter modeling to extract the linear prediction residual whose peaks corresponds to the candidate epochs [20,21,22,23]. b) Other methods which rely on the properties of excitation signal such as impulse nature [2,24,25,26] to obtain the exemplary signal.…”
Section: Introductionmentioning
confidence: 99%