Proceedings of the 16th ACM International Conference on Multimedia 2008
DOI: 10.1145/1459359.1459382
|View full text |Cite
|
Sign up to set email alerts
|

Combination of audio and lyrics features for genre classification in digital audio collections

Abstract: In many areas multimedia technology has made its way into mainstream. In the case of digital audio this is manifested in numerous online music stores having turned into profitable businesses. The widespread user adaption of digital audio both on home computers and mobile players show the size of this market. Thus, ways to automatically process and handle the growing size of private and commercial collections become increasingly important; along goes a need to make music interpretable by computers. The most obv… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
31
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 52 publications
(38 citation statements)
references
References 14 publications
(14 reference statements)
0
31
0
Order By: Relevance
“…By combining SSD and Symbolic features they achieved an accuracy of 74.5%, an improvement of 3.1% in the classification accuracy. On a different database, but compatible in size with the LMD, [16] has achieved an accuracy of 66.32% with SSD features alone and an accuracy of 68.72%, an improvement of 2.40%, when combining SSD with features extracted from the song Lyrics. If we compare the best results obtained in the ISMIR genre database, we will see that by using only SSD we have an accuracy of 76.12%.…”
Section: Discussion and Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…By combining SSD and Symbolic features they achieved an accuracy of 74.5%, an improvement of 3.1% in the classification accuracy. On a different database, but compatible in size with the LMD, [16] has achieved an accuracy of 66.32% with SSD features alone and an accuracy of 68.72%, an improvement of 2.40%, when combining SSD with features extracted from the song Lyrics. If we compare the best results obtained in the ISMIR genre database, we will see that by using only SSD we have an accuracy of 76.12%.…”
Section: Discussion and Related Workmentioning
confidence: 99%
“…The additional sources of information that has been used to augment the content-based approaches are: Cultural information by using web-mining techniques [27]; Boolean meta-data tags representing the music context [1]; Lyrics [16]; Symbolic representation by using a transcription system [15]; and the combination of cultural information and symbolic representation features [18]. In all these approaches the use of additional sources of information has improved the classification accuracy when compared to using only the audio content-based features.…”
Section: Discussion and Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…There are many novel sets of style features for automatic lyrics processing. Rudolf Mayer [11] has presented features to capture rhyme, parts-of-speech, and text statistics characteristics for song lyrics. He has combined these new feature sets with the standard bag-of-words features and wellknown feature sets for acoustic analysis of digital audio tracks.…”
Section: Related Workmentioning
confidence: 99%