2004
DOI: 10.1002/asi.20060
|View full text |Cite
|
Sign up to set email alerts
|

Music analysis and retrieval systems for audio signals

Abstract: The constantly increasing amount of audio available in digital form necessitates the development of software systems for analyzing and retrieving digital audio. In this work, we describe our efforts in developing such systems. More specifically, we describe the design philosophy behind our approach, the specific problems we try to solve, and how we evaluate the performance of our algorithms. Automatic music analysis and retrieval of non-speech digital audio is a relatively new field, and the existing technique… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
55
0

Year Published

2006
2006
2014
2014

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 38 publications
(56 citation statements)
references
References 19 publications
1
55
0
Order By: Relevance
“…Half of these report results from a single split [8,10,12,27,45,52]; but the other half reports a mean of many trials, e.g., 5 [43], 30 [97], and 100 trials [65,81,92,93]. Seven papers [6,21,38,56,59,69,87] do not coherently describe their design parameters.…”
Section: Using Gtzanmentioning
confidence: 99%
See 1 more Smart Citation
“…Half of these report results from a single split [8,10,12,27,45,52]; but the other half reports a mean of many trials, e.g., 5 [43], 30 [97], and 100 trials [65,81,92,93]. Seven papers [6,21,38,56,59,69,87] do not coherently describe their design parameters.…”
Section: Using Gtzanmentioning
confidence: 99%
“…GTZAN is composed of 1,000 half-minute music audio excerpts singly labeled in ten categories [92,93]; and though its use is so widespread, it has always been missing metadata identifying its contents. In fact, GTZAN was not expressly created for MGR, 2 but its availability has made it a benchmark dataset, and thus a measuring stick for comparing MGR systems, e.g., [102].…”
Section: Introductionmentioning
confidence: 99%
“…All features were extracted with the MIRtoolbox 2 ) using a framebased approach (Tzanetakis & Cook, 2002a Harte, Sandler, & Gasser, 2006), the centroid of an uncollapsed Chromagram], the frames were 2 s long, with an overlap of 50%, while for structural features (Repetition), frame length was 100 ms and overlap also at 50%. The results from the frames were then summarized by either the mean, the standard deviation, or the slope function.…”
Section: Feature Extractionmentioning
confidence: 99%
“…This is summarized in Table 1, which provides concrete design guidelines for the use of music characteristics in an AMP. Songs in music databases can be tagged with music characteristics by employing one of the algorithms proposed by [19,26,31,36]. Then, using the guidelines provided, an AMP can be implemented directly.…”
Section: Discussionmentioning
confidence: 99%
“…It is generally accepted that music characteristics can influence affect [13,18,27,33]. Consequently, several algorithms were already introduced for content analysis of music [19,25,31,36]. They are also employed in music systems like last.fm [17]; although, for other purposes than to direct affect.…”
Section: Music Is Known For Its Ability To Change Affect; It Ismentioning
confidence: 99%