Basic filters for convolutional neural networks applied to music: Training or design?

Dörfler, Monika; Grill, Thomas; Bammer, Roswitha; Flexer, Arthur

doi:10.1007/s00521-018-3704-x

Cited by 21 publications

(12 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Remark For Gabor multipliers c (ψ ⊗ ψ), Propositions 5.4 and 5.5 were proved in [14, Lem. 14], and have been used in the theory of convolutional neural networks [13].…”

Section: The Case C ∈ ∞ (3)mentioning

confidence: 99%

Quantum Harmonic Analysis on Lattices and Gabor Multipliers

Skrettingland

2020

J Fourier Anal Appl

View full text Add to dashboard Cite

We develop a theory of quantum harmonic analysis on lattices in R 2d . Convolutions of a sequence with an operator and of two operators are defined over a lattice, and using corresponding Fourier transforms of sequences and operators we develop a version of harmonic analysis for these objects. We prove analogues of results from classical harmonic analysis and the quantum harmonic analysis of Werner, including Tauberian theorems and a Wiener division lemma. Gabor multipliers from time-frequency analysis are described as convolutions in this setting. The quantum harmonic analysis is thus a conceptual framework for the study of Gabor multipliers, and several of the results include results on Gabor multipliers as special cases.Keywords Gabor multipliers • Tauberian theorems • Feichtinger's algebra • Fourier-Wigner transformCommunicated by Hans G. Feichtinger.

show abstract

“…Remark For Gabor multipliers c (ψ ⊗ ψ), Propositions 5.4 and 5.5 were proved in [14, Lem. 14], and have been used in the theory of convolutional neural networks [13].…”

Section: The Case C ∈ ∞ (3)mentioning

confidence: 99%

Quantum Harmonic Analysis on Lattices and Gabor Multipliers

Skrettingland

2020

J Fourier Anal Appl

View full text Add to dashboard Cite

show abstract

“…Related principles of dimension reduction for other clinical classification problems in OCT have already been successfully applied in [9]. In the second experiment we aim to categorize musical instruments based on their spectrogram, see [18] for related results. Our utilized augmented target loss functions can increase the accuracy in both experiments.…”

Section: Introductionmentioning

confidence: 99%

On Orthogonal Projections for Dimension Reduction and Applications in Augmented Target Loss Functions for Learning Problems

et al. 2019

View full text Add to dashboard Cite

The use of orthogonal projections on high-dimensional input and target data in learning frameworks is studied. First, we investigate the relations between two standard objectives in dimension reduction, preservation of variance and of pairwise relative distances. Investigations of their asymptotic correlation as well as numerical experiments show that a projection does usually not satisfy both objectives at once. In a standard classification problem we determine projections on the input data that balance the objectives and compare subsequent results. Next, we extend our application of orthogonal projections to deep learning tasks and introduce a general framework of augmented target loss functions. These loss functions integrate additional information via transformations and projections of the target data. In two supervised learning problems, clinical image segmentation and music information classification, the application of our proposed augmented target loss functions increase the accuracy.

show abstract

“…The representation generalizes the first layer of the scattering transform [3]. Independently, a related transform was developed by Dörfler et al [4], which is equivalent to the mel spectrogram and therefore not frequencyuniform and pitch-invariant at the same time.…”

Section: Analysis Of Existing Representationsmentioning

confidence: 99%

A Frequency‐Uniform and Pitch‐Invariant Time‐Frequency Representation

Schulze

King

2019

Proc Appl Math and Mech

View full text Add to dashboard Cite

We introduce the terms frequency-uniformity and pitch-invariance in order to characterize time-frequency representations. A frequency-uniform representation has the property that it displays Dirac transients as a straight line in the spectrogram, while a pitch-invariant representation translates pitch change into shifts, which is adequate for melodic instruments. We propose a novel representation that fulfills both criteria.

show abstract

Basic filters for convolutional neural networks applied to music: Training or design?

Cited by 21 publications

References 28 publications

Quantum Harmonic Analysis on Lattices and Gabor Multipliers

Quantum Harmonic Analysis on Lattices and Gabor Multipliers

On Orthogonal Projections for Dimension Reduction and Applications in Augmented Target Loss Functions for Learning Problems

A Frequency‐Uniform and Pitch‐Invariant Time‐Frequency Representation

Contact Info

Product

Resources

About