Thomas Prätzlich scite author profile

Bittner

2015

When recording a live musical performance, the different voices, such as the instrument groups or soloists of an orchestra, are typically recorded in the same room simultaneously, with at least one microphone assigned to each voice. However, it is difficult to acoustically shield the microphones. In practice, each one contains interference from every other voice. In this paper, we aim to reduce these interferences in multi-channel recordings to recover only the isolated voices. Following the recently proposed Kernel Additive Modeling framework, we present a method that iteratively estimates both the power spectral density of each voice and the corresponding strength in each microphone signal. With this information, we build an optimal Wiener filter, strongly reducing interferences. The trade-off between distortion and separation can be controlled by the user through the number of iterations of the algorithm. Furthermore, we present a computationally effective approximation of the iterative procedure. Listening tests demonstrate the effectiveness of the method.

show abstract

Memory-restricted multiscale dynamic time warping

Driedger

2016

Score-informed audio decomposition and applications

Driedger

Grohganz

et al. 2013

The separation of different sound sources from polyphonic music recordings constitutes a complex task since one has to account for different musical and acoustical aspects. In the last years, various score-informed procedures have been suggested where musical cues such as pitch, timing, and track information are used to support the source separation process. In this paper, we discuss a framework for decomposing a given music recording into notewise audio events which serve as elementary building blocks. In particular, we introduce an interface that employs the additional score information to provide a natural way for a user to interact with these audio events. By simply selecting arbitrary note groups within the score a user can access, modify, or analyze corresponding events in a given audio recording. In this way, our framework not only opens up new ways for audio editing applications, but also serves as a valuable tool for evaluating and better understanding the results of source separation algorithms.

show abstract

Triple-based analysis of music alignments without the need of ground-truth annotations

2016

Freischutz digital: A multimodal scenario for informed music processing

Bohl³

et al. 2013

In the last decade there has been an explosion in the availability of digitized music material, which comprises data of various formats and modalities including textual, symbolic, acoustic and visual representations. For example, in the case of an opera there typically exist digitized versions of the libretto, different editions of the musical score, as well as a large number of performances given as audio and video recordings. In this paper, we give an overview of various informed approaches to music processing, where the availability of multiple sources of music-related information is used for supporting and improving the analysis of music data. Considering the scenario of the opera "Der Freischütz" by Carl Maria von Weber-a work of central musical importance, where one can draw upon a rich body of sources-we highlight how the identification and creation of crossmodal relationships are a key issue in multimedia processing.

show abstract

Das Gesamtkunstwerk Oper aus Datensicht

et al. 2015

Known-Artist Live Song Identification Using Audio Hashprints

Tsai

2017

IEEE Trans. Multimedia

Sync Toolbox: A Python Package for Efficient, Robust, and Accurate Music Synchronization

Müller¹,

Özer²,

Krause³

et al. 2021

JOSS