Robust Estimation of Fundamental Frequency Using Single Frequency Filtering Approach

Pannala, Vishala; Aneeja, G.; Kadiri, Sudarsana Reddy; Yegnanarayana, B.

doi:10.21437/interspeech.2016-1401

Cited by 19 publications

(15 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Performance of the proposed method is compared with eight standard methods. The eight standard methods are SWIPE [32], YIN [9], RAPT [7] and SHRP [16], YAAPT [8], SRH [15], PEFAC [19] and SFF-CEP [33]. For all the methods, F0 search range was set between 60 − 1500 Hz according the study on singing voice in [3].…”

Section: Methods For Comparisonmentioning

confidence: 99%

Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source

Kadiri¹,

Yegnanarayana²

2018

Interspeech 2018

Self Cite

View full text Add to dashboard Cite

This paper focuses on the problem of estimating fundamental frequency from singing voice. Estimation of fundamental frequency is a well studied topic in the speech research community. From the recent studies on fundamental frequency estimation from singing voice with state-of-art methods proposed for speech, there exists a significant gap in accuracy for singing voice. This is mainly because of the wider and rapid variations in pitch in singing voice compared to that in speech. To overcome this, in this paper we propose a method to derive the fundamental frequency from singing voice by exploiting the harmonics of impulse-like excitation in sequence of glottal cycles. The proposed method is compared with the eight state-of-art methods such as YIN, SWIPE, YAAPT, RAPT, SRH, SFF CEP, PEFAC and SHRP on the LYRICS singing database. From the experimental results, it is observed that the accuracy of fundamental frequency by the proposed method is better than many state-of-art methods in various singing categories and laryngeal mechanisms.

show abstract

Section: Methods For Comparisonmentioning

confidence: 99%

Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source

Kadiri¹,

Yegnanarayana²

2018

Interspeech 2018

Self Cite

View full text Add to dashboard Cite

show abstract

“…The SFF method is used to derive the amplitude envelope of the speech signal at every sample for a given frequency [32]. The SFF spectrum has been shown to be useful in finding burst-onset points [29] and glottal closure instants [30], and it has been demonstrated to exhibit high spectral resolution for important speech features such as harmonics and resonances [27].…”

Section: A Sffmentioning

confidence: 99%

“…This architecture was chosen in the current study because it was shown in [25] to be the best performing system in dialect classification compared to two reference techniques. The spectrum computed by single frequency filtering (SFF) has been shown to give good spectral resolution to indicate harmonics and resonances [27] and good temporal resolution to model speech excitation features such as impulse-like events [28]. The SFF spectrum has also shown promising performance in determining burstonset points related to voice-onset time (VOT) and glottal closure instances compared to the short-time Fourier transform (STFT) spectrum [28]- [30].…”

Section: Introductionmentioning

confidence: 99%

Mel-Weighted Single Frequency Filtering Spectrogram for Dialect Identification

et al. 2020

Self Cite

View full text Add to dashboard Cite

show abstract

“…The instantaneous energy for a speech segment (Figure 2 (a)) is shown in Figure 2 (c). The equation for instantaneous energy E[n] is given below [28],…”

Section: Parameters Used For Feature Extractionmentioning

confidence: 99%

Detection of Replay Attacks Using Single Frequency Filtering Cepstral Coefficients

Alluri¹,

Achanta²,

Kadiri³

et al. 2017

Interspeech 2017

Self Cite

View full text Add to dashboard Cite

Automatic speaker verification systems are vulnerable to spoofing attacks. Recently, various countermeasures have been developed for detecting high technology attacks such as speech synthesis and voice conversion. However, there is a wide gap in dealing with replay attacks. In this paper, we propose a new feature for replay attack detection based on single frequency filtering (SFF), which provides high temporal and spectral resolution at each instant. Single frequency filtering cepstral coefficients (SFFCC) with Gaussian mixture model classifier are used for the experimentation on the standard BTAS-2016 corpus. The previously reported best result, which is based on constant Q cepstral coefficients (CQCC) achieved a half total error rate of 0.67 % on this data-set. Our proposed method outperforms the state of the art (CQCC) with a half total error rate of 0.0002 %.

show abstract

Robust Estimation of Fundamental Frequency Using Single Frequency Filtering Approach

Cited by 19 publications

References 24 publications

Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source

Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source

Mel-Weighted Single Frequency Filtering Spectrogram for Dialect Identification

Detection of Replay Attacks Using Single Frequency Filtering Cepstral Coefficients

Contact Info

Product

Resources

About