2014
DOI: 10.1109/taslp.2014.2352453
|View full text |Cite
|
Sign up to set email alerts
|

BaNa: A Noise Resilient Fundamental Frequency Detection Algorithm for Speech and Music

Abstract: Abstract-Fundamental frequency (F0) is one of the essential features in many acoustic related applications. Although numerous F0 detection algorithms have been developed, the detection accuracy in noisy environments still needs improvement. We present a hybrid noise resilient F0 detection algorithm named BaNa that combines the approaches of harmonic ratios and Cepstrum analysis. A Viterbi algorithm with a cost function is used to identify the F0 value among several F0 candidates. Speech and music databases wit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
29
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(33 citation statements)
references
References 34 publications
(31 reference statements)
0
29
0
Order By: Relevance
“…In this case, even if the speech signal is corrupted by noise, the affect of noise is suppressed. This is the motivation in [14,15], where a long length of N is used commonly. In this paper, we consider to use the Rectangular window instead of the Hanning and Hamming windows, and keep a standard length of N .…”
Section: Motivationmentioning
confidence: 99%
See 2 more Smart Citations
“…In this case, even if the speech signal is corrupted by noise, the affect of noise is suppressed. This is the motivation in [14,15], where a long length of N is used commonly. In this paper, we consider to use the Rectangular window instead of the Hanning and Hamming windows, and keep a standard length of N .…”
Section: Motivationmentioning
confidence: 99%
“…Reliable detection of the pitch period (T 0 ) being the inverse of the fundamental frequency (F 0 ) from speech is required in a wide range of applications such as speech coding, speech recognition, speech enhancement, speech synthesis, and so on. Therefore, a large number of pitch detection methods have been addressed up to now [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17][18][19].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…It is known that source filter decomposition through LP analysis fails drastically, for the voices such as singing voice, due to significant coupling between source and system. Some methods combine various techniques such as combination of harmonic ratios and cepstral analyses [18,19]. In methods that use time-frequency domain, the speech signal is decomposed into multiple frequency bands and then time domain methods such as autocorrelation are applied on each of the sub bands [20].…”
Section: Introductionmentioning
confidence: 99%
“…Examples of this kind are sub-harmonics to harmonics ratio (SHRP) [11], summation of residual harmonics [12], dominant harmonics [13], sawtooth waveform inspired pitch estimator (SWIPE) [14], etc. Also, some approaches [15,16] combine various techniques of F0 estimation (e.g., [15] combines harmonic ratios and cepstral analysis) for robustness under degraded conditions.…”
Section: Introductionmentioning
confidence: 99%