2020
DOI: 10.1007/978-3-030-43887-6_38
|View full text |Cite
|
Sign up to set email alerts
|

Rhythm, Chord and Melody Generation for Lead Sheets Using Recurrent Neural Networks

Abstract: Music that is generated by recurrent neural networks often lacks a sense of direction and coherence. We therefore propose a two-stage LSTM-based model for lead sheet generation, in which the harmonic and rhythmic templates of the song are produced first, after which, in a second stage, a sequence of melody notes is generated conditioned on these templates. A subjective listening test shows that our approach outperforms the baselines and increases perceived musical coherence.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(10 citation statements)
references
References 5 publications
(6 reference statements)
0
10
0
Order By: Relevance
“…1) LSTM-based encoder-decoder: Inspired by [11], we implemented a 3-layer Bidirectional LSTM (BiL-STM) [34] Encoder, consisting of 512 hidden units and a 3-layer Decoder of 1,024 hidden units with 30% dropout between consecutive layers. The BiLSTM states provided by the Encoder allow the lead sheet generator (i.e., Decoder) to look back as well as ahead at the sequence of music parameters defined by the user.…”
Section: Model Architecturesmentioning
confidence: 99%
See 3 more Smart Citations
“…1) LSTM-based encoder-decoder: Inspired by [11], we implemented a 3-layer Bidirectional LSTM (BiL-STM) [34] Encoder, consisting of 512 hidden units and a 3-layer Decoder of 1,024 hidden units with 30% dropout between consecutive layers. The BiLSTM states provided by the Encoder allow the lead sheet generator (i.e., Decoder) to look back as well as ahead at the sequence of music parameters defined by the user.…”
Section: Model Architecturesmentioning
confidence: 99%
“…In addition, there was a limit of up to 32 bars length for every song. • Inspired by [11], we eliminated polyphonic melody parts and ignored ties between notes from different bars. Moreover, we unfolded repetitions since lead sheets can contain repeated phrases.…”
Section: A Data Collection and Pre-processingmentioning
confidence: 99%
See 2 more Smart Citations
“…The pitch sequence contains the pitch range and a rest; the duration sequence contains all types of note duration in music; the root note of a chord is marked with a single note name in the chord sequence, and the type of chord is represented by a 12-dimensional binary vector; the bar position represents the relative position of the note in a bar, and its value is related to the beat resolution. Similarly, [109] represents a lead sheet as three sequences of pitch, rhythm and chord, and stipulates that only one note is played at each time step; [110] represents a bar as two sequences with equal length, where the pitch sequence contains all the pitches and uses "•" as padding, and the rhythm sequence replaces all pitches with the symbol "O", and uses "_" to depict the continuation of a note.…”
Section: ) Sequence-basedmentioning
confidence: 99%