2022
DOI: 10.3233/sw-210446
|View full text |Cite
|
Sign up to set email alerts
|

MIDI2vec: Learning MIDI embeddings for reliable prediction of symbolic music metadata

Abstract: An important problem in large symbolic music collections is the low availability of high-quality metadata, which is essential for various information retrieval tasks. Traditionally, systems have addressed this by relying either on costly human annotations or on rule-based systems at a limited scale. Recently, embedding strategies have been exploited for representing latent factors in graphs of connected nodes. In this work, we propose MIDI2vec, a new approach for representing MIDI files as vectors based on gra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 36 publications
0
2
0
Order By: Relevance
“…In the digitization of intangible cultural heritage, efforts tend to focus on phenomenological digitization, targeting the recording of kinetic or vocal activities [26][27][28][29][30][31]. Cinematographic and 3D motion digitization enable the recording of performing arts in multiple media and formats, exhibiting immersive qualities and interactive experiences [32,33] similar to musical content [34].…”
Section: Datamentioning
confidence: 99%
“…In the digitization of intangible cultural heritage, efforts tend to focus on phenomenological digitization, targeting the recording of kinetic or vocal activities [26][27][28][29][30][31]. Cinematographic and 3D motion digitization enable the recording of performing arts in multiple media and formats, exhibiting immersive qualities and interactive experiences [32,33] similar to musical content [34].…”
Section: Datamentioning
confidence: 99%
“…In the digitization of intangible cultural heritage, efforts tend to focus on phenomenological digitization, targeting the recording of kinetic or vocal activities [26][27][28][29][30][31]. Cinematographic and 3D motion digitization enable the recording of performing arts in multiple media and formats, exhibiting immersive qualities and interactive experiences [32,33] similar to musical content [34].…”
Section: Datamentioning
confidence: 99%
“…For the same task, in Bernardo & Langlois (2008) authors used the pitch levels and durations that describe each track to extract a set of features that are used to train a classifier. Recently, Lisena, Meroño-Peñuela & Troncy (2022) used graph embedding techniques to represent MIDI files as vectors. Basically, MIDI files were represented as a graph and node2vec ( Grover & Leskovec, 2016 ) was then run to generate embeddings using random walks in the graph.…”
Section: Related Workmentioning
confidence: 99%