2006
DOI: 10.1109/mmul.2006.3
|View full text |Cite
|
Sign up to set email alerts
|

Automatic structure detection for popular music

Abstract: Music structure is very important for semantic music understanding. We propose a novel approach for popular music structure detection. The proposed approach applies beat space segmentation, chord detection, singing voice boundary detection, melody and content based similarity region detection to music structure detection. A frequency scaling "Octave Scale" is used to calculate Cepstral coefficients to represent the music content. The experiments illustrate that the proposed approach achieves better performance… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(18 citation statements)
references
References 16 publications
0
18
0
Order By: Relevance
“…We consider frequencies between 60 Hz and 1000 Hz, which correspond to midi notes from B1 to B5. The upper limit is set to 1 kHz because the fundamentals and harmonics of the music notes in popular music are usually stronger than the non-harmonic components up to 1 kHz [24]. This choice is also supported by the fact that many of the higher harmonics, which are whole number multiples of the fundamental frequency, are far from any note of the Western chromatic scale.…”
Section: ) Parametersmentioning
confidence: 99%
See 1 more Smart Citation
“…We consider frequencies between 60 Hz and 1000 Hz, which correspond to midi notes from B1 to B5. The upper limit is set to 1 kHz because the fundamentals and harmonics of the music notes in popular music are usually stronger than the non-harmonic components up to 1 kHz [24]. This choice is also supported by the fact that many of the higher harmonics, which are whole number multiples of the fundamental frequency, are far from any note of the Western chromatic scale.…”
Section: ) Parametersmentioning
confidence: 99%
“…At each time instant t m ,w ec o m p u t et h ec o s i n ed i s t a n c e s between the observation vector O(t m ) and each of the 24 chord templates CT i ,i ∈ [1,24].…”
Section: Chord Symbol Probabilities Computationmentioning
confidence: 99%
“…It is based on the premise that some chord combinations are more common around structural boundaries, especially when expressed as relative chords in a key, giving a musicologically richer representation that is made possible by the concurrent estimation of keys and chords. In that regard, it is more similar to previous work by Maddage [5] or by Lee [3], only do their systems work sequentially because the chord and key estimates are used as inputs to the structure estimation, whereas our model generates them concurrently.…”
Section: Introductionmentioning
confidence: 81%
“…Additional modeling constraints or auxiliary information can further improve chord labeling accuracy. These include the prior identification of the fundamental frequency or root note of each chord before the chromagram is estimated [31], information about the metrical structure [32], information about the musical structure [33], or the musical context [30]. Current state-of-the-art chord labeling programs from audio have reached an identification accuracy of up to 80% as measured by the time overlap between predicted and ground truth chord labels; see [34].…”
Section: Automatic Chord Labelingmentioning
confidence: 99%