End-to-End Neural Optical Music Recognition of Monophonic Scores

Calvo-Zaragoza, Jorge; Rizo, David

doi:10.3390/app8040606

Cited by 62 publications

(83 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We use the Symbol Error Rate (SER) [17,18,19] metric. Similarly to Word Error Rate (WER) [28], commonly used in text recognition community, SER is computed as the Levenshtein distance: the sum of edit operations that are needed to convert the output of our method into the groundtruth in terms of symbol insertions (I), substitutions (S) and deletions (D).…”

Section: Discussionmentioning

confidence: 99%

“…For example, Van der Wel et al [17] use Convolutional Neural Networks (CNNs) and sequenceto-sequence (seq2seq) models for recognizing monophonic printed music scores. Calvo-Zaragoza et al [18,19] also use a CNN to extract features from printed music scores and feed a Recurrent Neural Network. To avoid the alignment between the music score and the ground-truth data, they use the Connectionist Temporal Classification (CTC) loss function commonly used in speech and text recognition.…”

Section: Deep Learning-based Approachesmentioning

confidence: 99%

“…Printed dataset: we use a subset of PrIMuS dataset [19], which consists of rendered incipts from the RISM 4 . It is annotated at primitive level i.e.…”

Section: Datasetsmentioning

confidence: 99%

See 2 more Smart Citations

From Optical Music Recognition to Handwritten Music Recognition: A baseline

Baró

Riba

Calvo-Zaragoza

et al. 2019

Pattern Recognition Letters

Self Cite

View full text Add to dashboard Cite

Optical Music Recognition (OMR) is the branch of document image analysis that aims to convert images of musical scores into a computer-readable format. Despite decades of research, the recognition of handwritten music scores, concretely the Western notation, is still an open problem, and the few existing works only focus on a specific stage of OMR. In this work, we propose a full Handwritten Music Recognition (HMR) system based on Convolutional Recurrent Neural Networks, data augmentation and transfer learning, that can serve as a baseline for the research community.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Deep Learning-based Approachesmentioning

confidence: 99%

See 1 more Smart Citation

From Optical Music Recognition to Handwritten Music Recognition: A baseline

Baró

Riba

Calvo-Zaragoza

et al. 2019

Pattern Recognition Letters

Self Cite

View full text Add to dashboard Cite

show abstract

“…The authors of [3], following the work of [17], observe that it does not make sense to apply directly the standard Unix diff utility to XML score files. A possible solution is to extract a linear representation of the graphical content [6], but motivating by the hierarchical structure of note beaming and tuplet grouping we chose to follow another approach and compare scores in terms of hierarchical structure, by using a tree-edit distance based on tree nodes operations, as proposed by [27] or [7].…”

Section: Figurementioning

confidence: 99%

“…At a detailed level, it is very valuable for musicologist and developers of version control systems to get precise clues on the locations of the differences between scores (e.g., between two editions of the same score). One difficulty that immediately arises for defining a diff tool for music scores is that, due to the nature/complexity of the music language, a music score contains multiple levels [6,10] that can be compared.…”

Section: Introductionmentioning

confidence: 99%