A Mid-Level Representation for Melody-Based Retrieval in Audio Collections

Marolt, Matija

doi:10.1109/tmm.2008.2007293

Cited by 46 publications

(42 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Melody is a salient musical descriptor of a piece of music [73] and, therefore, several cover song identification systems use melody representations as a main descriptor [49,50,68,78,79]. As a first processing step, these systems need to extract the This is a preliminary draft.…”

Section: Feature Extractionmentioning

confidence: 99%

“…To refine the obtained representation, cover detection systems usually need to combine a melody extractor with a voice/non-voice detector and other post-processing modules in order to achieve a more reliable representation [68,78,79]. Another possibility is to generate a so-called "mid-level" representation for these melodies [49,50], where the emphasis is not only put on melody extraction, but also on the feasibility to describe audio in a way that facilitates retrieval.…”

Section: Feature Extractionmentioning

confidence: 99%

“…The most straightforward way to achieve key invariance is to test all possible feature transpositions [21,23,36,38,39,41,50,55]. In the case of an octaveindependent representation, this implies the computation of a similarity measure for all possible circular (or ring) shifts in the pitch axis for each song.…”

Section: Key Invariancementioning

confidence: 99%

“…A very interesting approach to achieve transposition invariance is to use a 2D power spectrum [50] or a 2D autocorrelation function [37]. Autocorrelation is a well-known operator for converting signals into a delay or shift-invariant representation [58].…”

Section: Key Invariancementioning

confidence: 99%

“…This can be done independently of the descriptor used. Some cover song identification systems based on a PCP [23,55] or a melodic [49,50] representation use this strategy, and extensions with chords or other types of information could be easily devised. If the beat does not provide enough temporal resolution, a finer representation (e.g.…”

Section: Tempo Invariancementioning

confidence: 99%

See 4 more Smart Citations

Audio Cover Song Identification and Similarity: Background, Approaches, Evaluation, and Beyond

Serrà

Herrera

2010

Studies in Computational Intelligence

101

View full text Add to dashboard Cite

Section: Feature Extractionmentioning

confidence: 99%

Section: Feature Extractionmentioning

confidence: 99%

Section: Key Invariancementioning

confidence: 99%

Section: Key Invariancementioning

confidence: 99%

Section: Tempo Invariancementioning

confidence: 99%

See 3 more Smart Citations

Audio Cover Song Identification and Similarity: Background, Approaches, Evaluation, and Beyond

Serrà

Herrera

2010

Studies in Computational Intelligence

101

View full text Add to dashboard Cite

Communication-Free Widened Learning of Bayesian Network Classifiers Using Hashed Fiedler Vectors

Sampson

Borgelt

Berthold

2018

Advances in Intelligent Data Analysis XVII

View full text Add to dashboard Cite

Widening is a method where parallel resources are used to find better solutions from greedy algorithms instead of merely trying to find the same solutions more quickly. To date, every example of Widening has used some form of communication between the parallel workers to maintain their distances from one another in the model space. For the first time, we present a communication-free, widened extension to a standard machine learning algorithm. By using Locality Sensitive Hashing on the Bayesian networks' Fiedler vectors, we demonstrate the ability to learn classifiers superior to those of standard implementations and to those generated with a greedy heuristic alone.

show abstract

Content-Based Audio Retrieval

Müller

2015

Fundamentals of Music Processing

View full text Add to dashboard Cite

A Mid-Level Representation for Melody-Based Retrieval in Audio Collections

Cited by 46 publications

References 13 publications

Audio Cover Song Identification and Similarity: Background, Approaches, Evaluation, and Beyond

Audio Cover Song Identification and Similarity: Background, Approaches, Evaluation, and Beyond

Communication-Free Widened Learning of Bayesian Network Classifiers Using Hashed Fiedler Vectors

Content-Based Audio Retrieval

Contact Info

Product

Resources

About