David Rizo scite author profile

Optical Music Recognition is a field of research that investigates how to computationally decode music notation from images. Despite the efforts made so far, there are hardly any complete solutions to the problem. In this work, we study the use of neural networks that work in an end-to-end manner. This is achieved by using a neural model that combines the capabilities of convolutional neural networks, which work on the input image, and recurrent neural networks, which deal with the sequential nature of the problem. Thanks to the use of the the so-called Connectionist Temporal Classification loss function, these models can be directly trained from input images accompanied by their corresponding transcripts into music symbol sequences. We also present the Printed Images of Music Staves (PrIMuS) dataset, containing more than 80,000 monodic single-staff real scores in common western notation, that is used to train and evaluate the neural approach. In our experiments, it is demonstrated that this formulation can be carried out successfully. Additionally, we study several considerations about the codification of the output musical sequences, the convergence and scalability of the neural models, as well as the ability of this approach to locate symbols in the input score.

show abstract

Genre classification using chords and stochastic language models

Pérez-Sancho

Rizo

Iñesta

2009

Connection Science

View full text Add to dashboard Cite

Music genre meta-data is of paramount importance for the organization of music repositories. People use genre in a natural way when entering a music store or looking into music collections. Automatic genre classification has become a popular topic in music information retrieval research both with digital audio and symbolic data. This work focuses on the symbolic approach, bringing to music cognition some technologies, like the stochastic language models, already successfully applied to text categorization. The representation chosen here is to model chord progressions as n-grams and strings and then apply perplexity and Naive Bayes classifiers in order to model how often those structures are found in the target genres. Some genres and sub-genres among popular, jazz, and academic music have been considered and the results at different levels of the genre hierarchy for the techniques employed are presented and discussed.

show abstract

Melody Recognition with Learned Edit Distances

Habrard

Iñesta

Rizo

et al. 2008

View full text Add to dashboard Cite

Abstract. In a music recognition task, the classification of a new melody is often achieved by looking for the closest piece in a set of already known prototypes. The definition of a relevant similarity measure becomes then a crucial point. So far, the edit distance approach with a-priori fixed operation costs has been one of the most used to accomplish the task. In this paper, the application of a probabilistic learning model to both string and tree edit distances is proposed and is compared to a genetic algorithm cost fitting approach. The results show that both learning models outperform fixed-costs systems, and that the probabilistic approach is able to describe consistently the underlying melodic similarity model.

show abstract

Genre classification of music by tonal harmony

Pérez-Sancho

Rizo

Iñesta

et al. 2010

IDA

View full text Add to dashboard Cite

Tree-Structured Representation of Musical Information

Rizo

Iñesta

Moreno‐Seco

2003

View full text Add to dashboard Cite

Stochastic Text Models for Music Categorization

Pérez-Sancho

Rizo

Iñesta

2008

View full text Add to dashboard Cite

Abstract. Music genre meta-data is of paramount importance for the organization of music repositories. People use genre in a natural way when entering a music store or looking into music collections. Automatic genre classification has become a popular topic in music information retrieval research. This work brings to symbolic music recognition some technologies, like the stochastic language models, already successfully applied to text categorization. In this work we model chord progressions and melodies as n-grams and strings and then apply perplexity and naïve Bayes classifiers, respectively, in order to assess how often those structures are found in the target genres. Also a combination of the different techniques as an ensemble of classifiers is proposed. Some genres and sub-genres among popular, jazz, and academic music have been considered. The results show that the ensemble is a good trade-off approach able to perform well without the risk of choosing the wrong classifier.

show abstract

Data-based melody generation through multi-objective evolutionary computation

León

Iñesta

Calvo-Zaragoza

et al. 2016

Journal of Mathematics and Music

View full text Add to dashboard Cite

Genetic-based composition algorithms are able to explore an immense space of possibilities, but the main difficulty has always been the implementation of the selection process. In this work, sets of melodies are utilized for training a machine learning approach to compute fitness, based on different metrics. The fitness of a candidate is provided by combining the metrics, but their values can range through different orders of magnitude and evolve in different ways, which makes it hard to combine these criteria. In order to solve this problem, a multi-objective fitness approach is proposed, in which the best individuals are those in the Pareto-optimal frontier of the multi-dimensional fitness space. Melodic trees are also proposed as a data structure for chromosomic representation of melodies and genetic operators are adapted to them. Some experiments have been carried out using a graphical interface prototype that allows one to explore the creative capabilities of the proposed system. An Online Supplement is provided where the reader can find some technical details, information about the data used, generated melodies, and additional information about the developed prototype and its performance.

show abstract

Data-based melody generation through multi-objective evolutionary computation

León¹,

Iñesta²,

Calvo-Zaragoza³

et al. 2018

View full text Add to dashboard Cite

show abstract

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

David Rizo

End-to-End Neural Optical Music Recognition of Monophonic Scores

Genre classification using chords and stochastic language models

Melody Recognition with Learned Edit Distances

Genre classification of music by tonal harmony

Tree-Structured Representation of Musical Information

Stochastic Text Models for Music Categorization

Data-based melody generation through multi-objective evolutionary computation

Data-based melody generation through multi-objective evolutionary computation

Contact Info

Product

Resources

About