2008
DOI: 10.1109/tmm.2008.2007293
|View full text |Cite
|
Sign up to set email alerts
|

A Mid-Level Representation for Melody-Based Retrieval in Audio Collections

Abstract: Searching audio collections using high-level musical descriptors is a difficult problem, due to the lack of reliable methods for extracting melody, harmony, rhythm, and other such descriptors from unstructured audio signals. In the paper, we present a novel approach to melody-based retrieval in audio collections. Our approach supports audio, as well as symbolic queries and ranks results according to melodic similarity to the query. We introduce a beat-synchronous melodic representation consisting of salient me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
42
0

Year Published

2010
2010
2020
2020

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 46 publications
(42 citation statements)
references
References 13 publications
0
42
0
Order By: Relevance
“…Melody is a salient musical descriptor of a piece of music [73] and, therefore, several cover song identification systems use melody representations as a main descriptor [49,50,68,78,79]. As a first processing step, these systems need to extract the This is a preliminary draft.…”
Section: Feature Extractionmentioning
confidence: 99%
See 4 more Smart Citations
“…Melody is a salient musical descriptor of a piece of music [73] and, therefore, several cover song identification systems use melody representations as a main descriptor [49,50,68,78,79]. As a first processing step, these systems need to extract the This is a preliminary draft.…”
Section: Feature Extractionmentioning
confidence: 99%
“…To refine the obtained representation, cover detection systems usually need to combine a melody extractor with a voice/non-voice detector and other post-processing modules in order to achieve a more reliable representation [68,78,79]. Another possibility is to generate a so-called "mid-level" representation for these melodies [49,50], where the emphasis is not only put on melody extraction, but also on the feasibility to describe audio in a way that facilitates retrieval.…”
Section: Feature Extractionmentioning
confidence: 99%
See 3 more Smart Citations