Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-2495
|View full text |Cite
|
Sign up to set email alerts
|

Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source

Abstract: This paper focuses on the problem of estimating fundamental frequency from singing voice. Estimation of fundamental frequency is a well studied topic in the speech research community. From the recent studies on fundamental frequency estimation from singing voice with state-of-art methods proposed for speech, there exists a significant gap in accuracy for singing voice. This is mainly because of the wider and rapid variations in pitch in singing voice compared to that in speech. To overcome this, in this paper … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 29 publications
(40 reference statements)
0
6
0
Order By: Relevance
“…As F0 extraction is covered in several tutorials/books, this topic is not handled in detail in this review article, but we, instead, discuss the general aspects of F0 extraction briefly here and focus more on recent deep learning-based progress of the topic in Section VI-A. For more details on F0 extraction, please see [96]- [104], where various methods are reviewed by for the study of clean and noisy speech, as well as singing voices.…”
Section: B Extraction Of Fmentioning
confidence: 99%
See 3 more Smart Citations
“…As F0 extraction is covered in several tutorials/books, this topic is not handled in detail in this review article, but we, instead, discuss the general aspects of F0 extraction briefly here and focus more on recent deep learning-based progress of the topic in Section VI-A. For more details on F0 extraction, please see [96]- [104], where various methods are reviewed by for the study of clean and noisy speech, as well as singing voices.…”
Section: B Extraction Of Fmentioning
confidence: 99%
“…This property forms the basis for frequencydomain methods. Examples of methods belonging to this category are the SHRP [110], the SRH [111], the summation of impulse-sequence harmonics [104], the method of dominant harmonics [112], and the SWIPE [113].…”
Section: B Extraction Of Fmentioning
confidence: 99%
See 2 more Smart Citations
“…Identification of epoch locations plays a crucial role in many speech processing applications such as speech modification [1], excitation source modeling [2], inverse filtering [3,4], joint optimization in concatenative speech synthesis [5], speech pathology [6,7], etc. Apart from above applications, the high SNR property of the GCI was used in applications like glottal activity detection [8], pitch tracking [9,10], formant frequencies [11], analysis and detection of phonation types [12,13] and emotions [14,15], speaker recognition [16], speech enhancement [8], multi-speaker separation, identification of number of speakers from multi-speakers data [17] etc. Due to wider range of applications, GCI detection has received a considerable amount of research attention.…”
Section: Introductionmentioning
confidence: 99%