2017
DOI: 10.1109/taslp.2017.2666425
|View full text |Cite
|
Sign up to set email alerts
|

Extraction of Fundamental Frequency From Degraded Speech Using Temporal Envelopes at High SNR Frequencies

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0
10

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(24 citation statements)
references
References 24 publications
0
14
0
10
Order By: Relevance
“…SFF has high resolution in terms of frequency leading to sharp harmonics utilized for the extraction of fundamental frequency. 32,33 The discrete-time speech signal denoted by s(n) is differenced, and the differenced signal is denoted by x(n) = s(n) − s(n − 1). The sampling frequency is Fs.…”
Section: Proposed Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…SFF has high resolution in terms of frequency leading to sharp harmonics utilized for the extraction of fundamental frequency. 32,33 The discrete-time speech signal denoted by s(n) is differenced, and the differenced signal is denoted by x(n) = s(n) − s(n − 1). The sampling frequency is Fs.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…The output of SFF at each stage of frequency has large SNR areas employed to develop the speech and nonspeech region detection. SFF has high resolution in terms of frequency leading to sharp harmonics utilized for the extraction of fundamental frequency 32,33 . The discrete‐time speech signal denoted by s ( n ) is differenced, and the differenced signal is denoted by x ( n ) = s ( n ) − s ( n − 1).…”
Section: Proposed Methodsmentioning
confidence: 99%
“…It is also shown that the optimal choice of F depends on the frame length and the harmonic order [13]. However, for simplicity and fast implementation, in this paper, we set F = 2 14 . The state space for the discrete variables can be expressed as…”
Section: The State Evolution Modelmentioning
confidence: 99%
“…Using (9), (12), (13), (14), (19) and (20), a closed-form marginal likelihood can be obtained, i.e., p(y n |ẍ n , Y n−1 )…”
Section: Pitch Trackingmentioning
confidence: 99%
“…The presence of high SNR regions in the SFF outputs was exploited for speech and nonspeech detection, after suitably compensating for the noise in the degraded speech signal [13]. The SFF method was also used for extracting GCIs [14], locating burst onsets [15] and fundamental frequency extraction [16,17]. The significance of the phase of SFF output of speech is also examined recently in [18].…”
Section: Introductionmentioning
confidence: 99%