The Ottawa panel clinical practice guidelines for the management of knee osteoarthritis. Part two: strengthening exercise programs

Abstract. Much work is focused upon music genre recognition (MGR) from audio recordings, symbolic data, and other modalities. While reviews have been written of some of this work before, no survey has been made of the approaches to evaluating approaches to MGR. This paper compiles a bibliography of work in MGR, and analyzes three aspects of evaluation: experimental designs, datasets, and figures of merit.

show abstract

A Simple Method to Determine if a Music Information Retrieval System is a “Horse”

Sturm

2014

IEEE Trans. Multimedia

View full text Add to dashboard Cite

We propose and demonstrate a simple method to explain the figure of merit (FoM) of a music information retrieval (MIR) system evaluated in a dataset, specifically, whether the FoM comes from the system using characteristics confounded with the "ground truth" of the dataset. Akin to the controlled experiments designed to test the supposed mathematical ability of the famous horse "Clever Hans," we perform two experiments to show how three state-of-the-art MIR systems produce excellent FoM in spite of not using musical knowledge. This provides avenues for improving MIR systems, as well as their evaluation. We make available a reproducible research package so that others can apply the same method to evaluating other MIR systems.Index Terms-2-WORK system performance, 5-CONT content description and annotation, 5-SEAR multimedia search and retrieval.Bob L. Sturm (S'06-M'09) received the Ph.D. degree in electrical and computer engineering from the

show abstract

Deep Learning and Music Adversaries

Kereliuk

Sturm

Larsen

2015

IEEE Trans. Multimedia

110

View full text Add to dashboard Cite

Abstract-An adversary is essentially an algorithm intent on making a classification system perform in some particular way given an input, e.g., increase the probability of a false negative. Recent work builds adversaries for deep learning systems applied to image object recognition, which exploits the parameters of the system to find the minimal perturbation of the input image such that the network misclassifies it with high confidence. We adapt this approach to construct and deploy an adversary of deep learning systems applied to music content analysis. In our case, however, the input to the systems is magnitude spectral frames, which requires special care in order to produce valid input audio signals from network-derived perturbations. For two different train-test partitionings of two benchmark datasets, and two different deep architectures, we find that this adversary is very effective in defeating the resulting systems. We find the convolutional networks are more robust, however, compared with systems based on a majority vote over individually classified audio frames. Furthermore, we integrate the adversary into the training of new deep systems, but do not find that this improves their resilience against the same adversary.

show abstract

An analysis of the GTZAN music genre dataset

Sturm

2012

View full text Add to dashboard Cite

show abstract

The State of the Art Ten Years After a State of the Art: Future Research in Music Information Retrieval

Sturm

2014

Journal of New Music Research

View full text Add to dashboard Cite

The GTZAN dataset appears in at least 100 published works, and is the most-used public dataset for evaluation in machine listening research for music genre recognition (MGR). Our recent work, however, shows GTZAN has several faults (repetitions, mislabelings, and distortions), which challenge the interpretability of any result derived using it. In this article, we disprove the claims that all MGR systems are affected in the same ways by these faults, and that the performances of MGR systems in GTZAN are still meaningfully comparable since they all face the same faults. We identify and analyze the contents of GTZAN, and provide a catalog of its faults. We review how GTZAN has been used in MGR research, and find few indications that its faults have been known and considered. Finally, we rigorously study the effects of its faults on evaluating five different MGR systems. The lesson is not to banish GTZAN, but to use it with consideration of its contents.

show abstract

Ensemble Models for Spoofing Detection in Automatic Speaker Verification

Chettri

Stoller

Morfi

et al. 2019

View full text Add to dashboard Cite

Spectrograms -time-frequency representations of audio signals -have found widespread use in neural network-based spoofing detection. While deep models are trained on the fullband spectrum of the signal, we argue that not all frequency bands are useful for these tasks. In this paper, we systematically investigate the impact of different subbands and their importance on replay spoofing detection on two benchmark datasets: ASVspoof 2017 v2.0 and ASVspoof 2019 PA. We propose a joint subband modelling framework that employs n different sub-networks to learn subband specific features. These are later combined and passed to a classifier and the whole network weights are updated during training. Our findings on the ASVspoof 2017 dataset suggest that the most discriminative information appears to be in the first and the last 1 kHz frequency bands, and the joint model trained on these two subbands shows the best performance outperforming the baselines by a large margin. However, these findings do not generalise on the ASVspoof 2019 PA dataset. This suggests that the datasets available for training these models do not reflect real world replay conditions suggesting a need for careful design of datasets for training replay spoofing countermeasures.

show abstract

Two systems for automatic music genre recognition

Sturm

2012

View full text Add to dashboard Cite

We re-implement two state-of-the-art systems for music genre recognition, and closely examine their behavior. First, we find specific excerpts each system consistently and persistently mislabels. Second, we test the robustness of each system to spectral adjustments to audio signals. Finally, we expose the internal genre models of each system by testing if human can recognize the genres of music excerpts composed by each system to be highly genre-representative. Our results suggest that, though they have high mean classification accuracies, neither system is recognizing music genre.

show abstract

Machine learning research that matters for music creation: A case study

Sturm

Ben-Tal

Monaghan

et al. 2018

Journal of New Music Research

View full text Add to dashboard Cite

* Corresponding author 1 Research applying machine learning to music modeling and generation typically proposes model architectures, training methods and datasets, and gauges system performance using quantitative measures like sequence likelihoods and/or qualitative listening tests. Rarely does such work explicitly question and analyse its usefulness for and impact on real-world practitioners, and then build on those outcomes to inform the development and application of machine learning. This article attempts to do these things for machine learning applied to music creation. Together with practitioners, we develop and use several applications of machine learning for music creation, and present a public concert of the results. We reflect on the entire experience to arrive at several ways of advancing these and similar applications of machine learning to music creation.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Bob L. Sturm

A Survey of Evaluation in Music Genre Recognition

A Simple Method to Determine if a Music Information Retrieval System is a “Horse”

Deep Learning and Music Adversaries

An analysis of the GTZAN music genre dataset

The State of the Art Ten Years After a State of the Art: Future Research in Music Information Retrieval

Ensemble Models for Spoofing Detection in Automatic Speaker Verification

Two systems for automatic music genre recognition

Machine learning research that matters for music creation: A case study

Contact Info

Product

Resources

About