Combination of audio and lyrics features for genre classification in digital audio collections

Mayer, Rudolf; Neumayer, Robert; Rauber, Andreas

doi:10.1145/1459359.1459382

Cited by 52 publications

(38 citation statements)

References 14 publications

(14 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…By combining SSD and Symbolic features they achieved an accuracy of 74.5%, an improvement of 3.1% in the classification accuracy. On a different database, but compatible in size with the LMD, [16] has achieved an accuracy of 66.32% with SSD features alone and an accuracy of 68.72%, an improvement of 2.40%, when combining SSD with features extracted from the song Lyrics. If we compare the best results obtained in the ISMIR genre database, we will see that by using only SSD we have an accuracy of 76.12%.…”

Section: Discussion and Related Workmentioning

confidence: 99%

“…The additional sources of information that has been used to augment the content-based approaches are: Cultural information by using web-mining techniques [27]; Boolean meta-data tags representing the music context [1]; Lyrics [16]; Symbolic representation by using a transcription system [15]; and the combination of cultural information and symbolic representation features [18]. In all these approaches the use of additional sources of information has improved the classification accuracy when compared to using only the audio content-based features.…”

Section: Discussion and Related Workmentioning

confidence: 99%

“…This is due to the fact that the music genre in one of the most frequent items used in search queries, as pointed by several authors [5,13,21]. Over the past decade this task has received growing attention every year which lead to the development of different types of feature descriptors [9,14,26] and classifications approaches [1,15,16,18,24]. Current research on automatic music genre classification suggests that the use of content-based approaches have reached a glass ceiling [15].…”

Section: Introductionmentioning

confidence: 99%

“…Current research on automatic music genre classification suggests that the use of content-based approaches have reached a glass ceiling [15]. In order to overcome this situation, several authors have proposed the use of different types of information related to the music pieces in combination with the content-based features [1,15,16,18,27].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Improving automatic music genre classification with hybrid content-based feature vectors

Silla

Koerich

Kaestner

2010

Proceedings of the 2010 ACM Symposium on Applied Computing

View full text Add to dashboard Cite

Current research on the task of automatic music genre classification has been focusing on new classification approaches based on combining information from other sources than the music signal. The reason for this is that the use of contentbased approaches, i.e. using features extracted directly from the audio signal, seems to have reached a glass ceiling. In this work we show that by using different types of contentbased features together it is possible to substantially improve the classification accuracy. This is an interesting result as different types of content-based features aim, at a conceptual level, to capture the same type of information. In order to identify which types of content-based features are responsible for the predictive accuracy gain, we also used a feature selection (FS) approach based on a genetic algorithm (GA). The analysis of the results in two databases shows that the use of the GA for FS succeeds in selecting a representative subset without significant loss in accuracy. It also shows that all the different types of content-based features employed are important for the improvement of the accuracy in classifying music genres.

show abstract

Section: Discussion and Related Workmentioning

confidence: 99%

Section: Discussion and Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Improving automatic music genre classification with hybrid content-based feature vectors

Silla

Koerich

Kaestner

2010

Proceedings of the 2010 ACM Symposium on Applied Computing

View full text Add to dashboard Cite

show abstract

“…There are many novel sets of style features for automatic lyrics processing. Rudolf Mayer [11] has presented features to capture rhyme, parts-of-speech, and text statistics characteristics for song lyrics. He has combined these new feature sets with the standard bag-of-words features and wellknown feature sets for acoustic analysis of digital audio tracks.…”

Section: Related Workmentioning

confidence: 99%

Mood classifiaction of lyrics using SentiWordNet

Kumar

Minz

2013

2013 International Conference on Computer Communication and Informatics

View full text Add to dashboard Cite

The text data being unstructured pose multiple research issues in document classification. Relevant feature extraction is the foremost problem in the preprocessing stage. SentiWordNet is an ontology that includes numeric scores related to the positive or negative aspects of the words. The work in this paper explores the use of SentiWordNet to extract sentiment features of the words in the song lyrics. The experiments are carried out on a collection of 185 lyrics each belonging to one of the four classes. Three classification algorithms namely, Naïve Bayesian (NB), k-Nearest Neighbor (KNN) and Support Vector Machine (SVM) using six measures for attribute relevance analysis namely, Principal Component Analysis (PCA), Latent Semantic Analysis (LSA), Chi-Square (CS), Information Gain (IG), GINI Index (GI) and Gain Ratio (GR) have been applied to model the classifiers. The experiments examine the relevance of the sentiment features for classification. The ratio of the positive and negative scores, normalized ratio, and average of the positive and negative scores are three sentiment features. The experimental results indicate that the Naïve Bayesian classifier using the average of the positive and negative score as sentiment feature, and gain ratio as feature selection criteria achieve 78.27% accuracy based on top 10% of the features. The second best accuracy has been achieved by SVM-based classifiers using the average of the positive and negative score as sentiment feature and top 10% features applying all feature selection criteria except CS.

show abstract

Music subject classification based on lyrics and user interpretations

Choi

Lee

et al. 2016

Proc. Assoc. Info. Sci. Tech.

View full text Add to dashboard Cite

That music seekers consider song subject metadata to be helpful in their searching/browsing experience has been noted in prior published research. In an effort to develop a subject-based tagging system, we explored the creation of automatically generated song subject classifications. Our classifications were derived from two different sources of song-related text: 1) lyrics; and 2) user interpretations of lyrics collected from songmeanings.com. While both sources contain subject-related information, we found that user-generated interpretations always outperformed lyrics in terms of classification accuracy. This suggests that user interpretations are more useful in the subject classification task than lyrics because the semantically ambiguous poetic nature of lyrics tends to confuse classifiers. An examination of top-ranked terms and confusion matrices supported our contention that users' interpretations work better for detecting the meaning of songs than what is conveyed through lyrics.

show abstract

Combination of audio and lyrics features for genre classification in digital audio collections

Cited by 52 publications

References 14 publications

Improving automatic music genre classification with hybrid content-based feature vectors

Improving automatic music genre classification with hybrid content-based feature vectors

Mood classifiaction of lyrics using SentiWordNet

Music subject classification based on lyrics and user interpretations

Contact Info

Product

Resources

About