In this paper, we present a pitch detection algorithm that is extremely robust for both high quality and telephone speech.The kernel method for this algorithm is the ''NCCF or Normalized Cross Correlation" reported by David Talkin [IJ.Major innovations include: processing of the original acoustic signal and a nonlinearly processed version of the signal to partially restore very weak FO components; intelIigent peak picking to select multiple FO candidates and assign merit factors; and, incotporation of highly robust pitch contours obtained from smoothed versions of low frequency portions of spectrograms. Dynamic programming is used to fmd the "best"pitch track among all the candidates, using both local and transition costs. We evaluated our algorithm using the Keele pitch extraction reference database as "ground truth" for both "high quality" and ' 'telephone'' speech. For both types of speech, the error rates obtained are lower than the lowest reported in the literature. low frequency spectrograms are shown in the middle panel (original signal) and bottom panel (absolute value signal). The two curves overlaid on each spectrogram are explained in detaillater. This figure illustrates that FO is much more prominent in the lower panel than the middle panel (also verified by comparison with TIMIT version of same sentence). Similar effects were noted for many other sample signals, some studio quality as well. The strategy adopted was to completely process 0-7803-7402-9/02/$17 .00 «:l2002 IEEE 1-361
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.