2020
DOI: 10.1016/j.eswa.2020.113769
|View full text |Cite
|
Sign up to set email alerts
|

Data representations for audio-to-score monophonic music transcription

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
2

Relationship

2
6

Authors

Journals

citations
Cited by 20 publications
(8 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…In 2020, Miguel et al [2] developed an e2e technique depending on DNN for audio-to-score transcription of music from monophonic quotes. Here, an audio file was given as input that was modeled as a frame sequence, and a DNN was trained to provide a sequence of encoded music notes.…”
Section: Jazzmentioning
confidence: 99%
See 1 more Smart Citation
“…In 2020, Miguel et al [2] developed an e2e technique depending on DNN for audio-to-score transcription of music from monophonic quotes. Here, an audio file was given as input that was modeled as a frame sequence, and a DNN was trained to provide a sequence of encoded music notes.…”
Section: Jazzmentioning
confidence: 99%
“…transforming an acoustic signal into a symbolic representation, which comprises notes, their pitches, timings, and a classification of the instruments used. AMT [1] [2] is the process of automatically converting a musical sound signal into its representation as musical notation, through digital analysis of the musical signal". The AMT was the objective of many researchers from the time of its establishment, and currently, it has covered a wider range of subtasks [3] [4].…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we introduce the Quartets dataset. Quartets is a well-known collection employed in the Audio to Score field [30,3]. As the dataset provides the Humdrum **kern transcriptions from the excerpts of music, we produced a single-system transcription version of it.…”
Section: Corporamentioning
confidence: 99%
“…Concerning the recognition architectures, we consider a Convolutional Recurrent Neural Network (CRNN) scheme to approximate g (•). Recent works have applied this approach to both OMR [5,6] and AMT [18,19] transcription systems with remarkably successful results. Hence, we shall resort to these works to define our baseline single-modality transcription architectures within the multimodal framework.…”
Section: Neural End-to-end Base Recognition Systemsmentioning
confidence: 99%