2010
DOI: 10.1109/tasl.2009.2036813
|View full text |Cite
|
Sign up to set email alerts
|

Non-Negative Multilinear Principal Component Analysis of Auditory Temporal Modulations for Music Genre Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
59
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 95 publications
(59 citation statements)
references
References 32 publications
0
59
0
Order By: Relevance
“…STFT involves extracting several frames from the signals and analyzing them using a time-sliding window such that the relation between the variation of the frequency and the time can be identified. We used STFT to transform the original signals into a spatial-spectraltemporal domain as high-dimensional third-order tensors [17]. Given a signal that varies over time, STFT was used to determine the sinusoidal frequency and phase the content of the local sections.…”
Section: Short-time Fourier Transformmentioning
confidence: 99%
“…STFT involves extracting several frames from the signals and analyzing them using a time-sliding window such that the relation between the variation of the frequency and the time can be identified. We used STFT to transform the original signals into a spatial-spectraltemporal domain as high-dimensional third-order tensors [17]. Given a signal that varies over time, STFT was used to determine the sinusoidal frequency and phase the content of the local sections.…”
Section: Short-time Fourier Transformmentioning
confidence: 99%
“…This point of view, however, is not evident in much of the MGR literature, e.g., the three reviews devoted specifically to MGR (Aucouturier and Pachet 2003;Scaringella et al 2006;Fu et al 2011), the work of Tzanetakis and Cook (2002), Barbedo and Lopes (2008), Bergstra et al (2006a), Holzapfel and Stylianou (2008), Marques et al (2011b), Panagakis et al (2010a), Benetos and Kotropoulos (2010), and so on. It is thus not idiosyncratic to claim that one purpose of MGR could be to identify, discriminate between, and learn the criteria of music genres in order to produce genre labels that are indistinguishable from those humans would produce.…”
Section: Argumentsmentioning
confidence: 99%
“…Although much more elaborated music representations have been proposed in the literature, the just mentioned features perform quite well in practice [14,[22][23][24]. Most importantly, song-level representations are suitable for large-scale music classification problems since the space complexity for audio processing and analysis is reduced and the database overflow is prevented [3].…”
Section: Audio Feature Extractionmentioning
confidence: 99%
“…First, to be able to compare the performance of the LRSMs with that of the state-of-theart music classification methods, standard evaluation protocols were applied to the seven datasets. In particular, following [16,17,20,22,56,57], stratified 10-fold crossvalidation was applied to the GTZAN dataset. According to [15,16,54], the same protocol was also applied to the Homburg, Unique, 1517-Artists, and MTV datasets.…”
Section: Datasets and Evaluation Proceduresmentioning
confidence: 99%
See 1 more Smart Citation