In this paper, we propose a musical feature extracted from the bitstream of AAC (Advanced Audio Coding) compressed audio data without decoding to audio signals. We focus on the spectral data which are stored in the bitstream for representing the flatten MDCT (Modified Discrete Cosine Transform) of an audio signal. For computing the musical feature, we extract the spectral data and apply the Discrete Wavelet Transform (DWT) to the extracted spectral data. For musical genre classification, we use the discriminant analysis as a classifier. We experimented on 1, 498 AAC compressed audio data collected from 10 musical genres and evaluated the performance of the musical feature. We got the maximum correct ratios 81.24%. The experiments showed that the musical feature based on the spectral data in the bitstream had good performance for genre classification in the MPEG-4 AAC compressed domain.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.