ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683653
|View full text |Cite
|
Sign up to set email alerts
|

A Study on How Pre-whitening Influences Fundamental Frequency Estimation

Abstract: This paper deals with the influence of pre-whitening for the task of fundamental frequency estimation in noisy conditions. Parametric fundamental frequency estimators commonly assume that the noise is white and Gaussian and, therefore, they are only statistically efficient under those conditions. The noise is coloured in many practical applications and this will often result in problems of misidentifying an integer divisor or multiple of the true fundamental frequency (i.e., octave errors). The purpose of this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
11
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 9 publications
(11 citation statements)
references
References 23 publications
0
11
0
Order By: Relevance
“…The harmonic distortion of a DUT can distort the original harmonic structure of the input signal, and cause performance degradation by affecting the balance of Mel-frequency cepstrum coefficient (MFCC) [36][37][38]. Then, the noise of the DUT will flood the harmonic structural features to some extent, making it more difficult to be extracted [39]. Therefore, the SNR, SINAD, THD and ADDR performance of the DUTs indirectly affect the accuracy of ASR.…”
Section: Simulationmentioning
confidence: 99%
“…The harmonic distortion of a DUT can distort the original harmonic structure of the input signal, and cause performance degradation by affecting the balance of Mel-frequency cepstrum coefficient (MFCC) [36][37][38]. Then, the noise of the DUT will flood the harmonic structural features to some extent, making it more difficult to be extracted [39]. Therefore, the SNR, SINAD, THD and ADDR performance of the DUTs indirectly affect the accuracy of ASR.…”
Section: Simulationmentioning
confidence: 99%
“…Therefore, a prewhitening step is required to deal with the inconsistency between the white Gaussian noise model assumption and real life noise model. A linear prediction (LP) based prewhitening step is applied to each frame to deal with the non-white Gaussian noise (see [9], [43] for detail). The power spectral density (PSD) of the noise given noisy signals is estimated using the minimum mean-square error (MMSE) estimator [44].…”
Section: Prewhiteningmentioning
confidence: 99%
“…One example is when sub-harmonic errors appear when estimating the fundamental frequency (a.k.a. pitch) of voiced speech segments [6], [7] from estimators which assume WGN. A preprocessor which renders the coloured noise closer to white, namely a pre-whitener, can alleviate this problem.…”
Section: Introductionmentioning
confidence: 99%
“…A preprocessor which renders the coloured noise closer to white, namely a pre-whitener, can alleviate this problem. Applying pre-whitening using a linear filter is advantageous compared to a general linear transformation with, e.g., the Cholesky factor [4], since the effect of linear filtering can be modeled by only changing the sinusoidal amplitudes and phases [6], [7]. Unlike general linear transformations, linear filtering thus enables us to use many existing model-based estimators based on a WGN assumption.…”
Section: Introductionmentioning
confidence: 99%