2018
DOI: 10.48550/arxiv.1811.12214
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Play as You Like: Timbre-enhanced Multi-modal Music Style Transfer

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 18 publications
0
3
0
Order By: Relevance
“…Music styles depend on the semantic domain being discussed, such as timbre, performance, or composition styles. Timbre style transfer is usually audio-to-audio style transfer which aims at modifying the timbre such as instrument [9,20,32] or the gender of singers' voice [17,34]. Performance style transfer can be either audio-to-audio or symbolicto-audio, the latter such as piano performance rendering [12,22] refers to the tasks of converting deadpan performance data (e.g., MIDI) into expressive performance with a specific interpretation of timing and dynamics.…”
Section: Music Style Transfermentioning
confidence: 99%
See 2 more Smart Citations
“…Music styles depend on the semantic domain being discussed, such as timbre, performance, or composition styles. Timbre style transfer is usually audio-to-audio style transfer which aims at modifying the timbre such as instrument [9,20,32] or the gender of singers' voice [17,34]. Performance style transfer can be either audio-to-audio or symbolicto-audio, the latter such as piano performance rendering [12,22] refers to the tasks of converting deadpan performance data (e.g., MIDI) into expressive performance with a specific interpretation of timing and dynamics.…”
Section: Music Style Transfermentioning
confidence: 99%
“…We also introduce an adversarial loss L adv to enhance the training process. Following [20], we employ RaGAN [14] Let P r eal and P дen be the distributions of the image and the music representation, respectively. The adversarial loss is represented as…”
Section: Training the Music Visualization Net (Mvnet)mentioning
confidence: 99%
See 1 more Smart Citation