“…Since Gatys et al proposed the neural algorithm for image style transfer [11], deep learning-based style transfer has been extensively studied. Various types of neural network models now can modify the texture of an image [13,19,30], the genre or instrument of a music piece [20,21], and the sentiment of texts [29], with all their content information being preserved. Despite its success, one notable issue in these style transfer methods is that almost all of them operate merely within one single data modality, e.g., from one image to another image, or from one music piece to another.…”