2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017
DOI: 10.1109/icassp.2017.7953229
|View full text |Cite
|
Sign up to set email alerts
|

Towards confidence measures on fundamental frequency estimations

Abstract: The fundamental frequency is one of the prosodic parameters, and many algorithms have been developed for estimating the fundamental frequency of speech signals. Most of them provide good results on good quality speech signals, but their performance degrades when dealing with noisy signals. Moreover, although some provide a probability for the voicing decision, none of them indicate how reliable the estimated fundamental frequency is. In this paper, we investigate the computation of a confidence (or reliability… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 10 publications
0
4
0
Order By: Relevance
“…However, the confidence of the estimated pitch in the voiced case is seldom provided. A few exceptions are CREPE [14], which produces a confidence score computed from the activations of the last layer of the model, and [25], which directly addresses this problem, by training a neural network based on hand-crafted features to estimate the confidence of the estimated pitch. In contrast, in our work we explicitly augment the proposed model with a head aimed at estimating confidence in a fully unsupervised way.…”
Section: Related Workmentioning
confidence: 99%
“…However, the confidence of the estimated pitch in the voiced case is seldom provided. A few exceptions are CREPE [14], which produces a confidence score computed from the activations of the last layer of the model, and [25], which directly addresses this problem, by training a neural network based on hand-crafted features to estimate the confidence of the estimated pitch. In contrast, in our work we explicitly augment the proposed model with a head aimed at estimating confidence in a fully unsupervised way.…”
Section: Related Workmentioning
confidence: 99%
“…Currently there is no indication of the reliability of the estimated F0 values provided by the various pitch detection algorithms. Some preliminary work has been carried out in this direction [15], but further studies are still necessary.…”
Section: Fundamental Frequencymentioning
confidence: 99%
“…However, the confidence of the estimated pitch in the voiced case is seldom provided. A few exceptions are CREPE [12], which produces a confidence score computed from the activations of the last layer of the model, and [22], which directly addresses this problem, by training a neural network based on handcrafted features to estimate the confidence of the estimated pitch. In contrast, in our work we explicitly augment the proposed model with a head aimed at estimating confidence in a fully unsupervised way.…”
Section: Related Workmentioning
confidence: 99%