2021
DOI: 10.48550/arxiv.2109.07940
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription

Abstract: Automatic lyrics transcription (ALT), which can be regarded as automatic speech recognition (ASR) on singing voice, is an interesting and practical topic in academia and industry. ALT has not been well developed mainly due to the dearth of paired singing voice and lyrics datasets for model training.Considering that there is a large amount of ASR training data, a straightforward method is to leverage ASR data to enhance ALT training. However, the improvement is marginal when training the ALT system directly wit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 27 publications
(29 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?