Interspeech 2011 2011
DOI: 10.21437/interspeech.2011-317
|View full text |Cite
|
Sign up to set email alerts
|

A pitch tracking corpus with evaluation on multipitch tracking scenario

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
12
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 87 publications
(16 citation statements)
references
References 10 publications
0
12
0
Order By: Relevance
“…For training, we used the TIMIT [9] and PTDB-TUG speech datasets [20]. During the training, different scenarios are simulated where either one or two sources are concurrently active, similar to [2].…”
Section: Trainingmentioning
confidence: 99%
“…For training, we used the TIMIT [9] and PTDB-TUG speech datasets [20]. During the training, different scenarios are simulated where either one or two sources are concurrently active, similar to [2].…”
Section: Trainingmentioning
confidence: 99%
“…PTDB [31] is the dataset most commonly used in recent work on pitch estimation for speech. For this reason, we use PTDB as a representation of performance on speech data.…”
Section: A Datamentioning
confidence: 99%
“…accurately predict the fundamental frequency of speech on the PTDB dataset [31], while estimators with larger receptive fields [29], [30] are able to. As we will show, CREPE has no difficulty learning accurate pitch on PTDB-when the atypical and undocumented alignment between the audio and pitch of PTDB is addressed-and the generalization gap is due to mismatched data distributions between training and evaluation.…”
mentioning
confidence: 99%
“…The four methods were evaluated on the PTDB-TUG database [15]. The database contains clean utterances from 20 speakers (10 males and 10 females).…”
Section: A Experimental Setupmentioning
confidence: 99%