2018
DOI: 10.2478/ausi-2018-0009
|View full text |Cite
|
Sign up to set email alerts
|

Connecting the Last.fm Dataset to LyricWiki and MusicBrainz. Lyrics-based experiments in genre classification

Abstract: Music information retrieval has lately become an important field of information retrieval, because by profound analysis of music pieces important information can be collected: genre labels, mood prediction, artist identification, just to name a few. The lack of large-scale music datasets containing audio features and metadata has lead to the construction and publication of the Million Song Dataset (MSD) and its satellite datasets. Nonetheless, mainly because of licensing limitations, no freely available lyrics… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 26 publications
0
3
0
Order By: Relevance
“…In a similar direction, Bodó and Szilágyi generated a dataset for lyrics genre classification by combining Last.fm tags with MusicBrainz data [4]. MusicBrainz is an online database of music editorial metadata 1 .…”
Section: Lastfm Tagsmentioning
confidence: 99%
See 1 more Smart Citation
“…In a similar direction, Bodó and Szilágyi generated a dataset for lyrics genre classification by combining Last.fm tags with MusicBrainz data [4]. MusicBrainz is an online database of music editorial metadata 1 .…”
Section: Lastfm Tagsmentioning
confidence: 99%
“…Another publicly available dataset is the Spotify Audio Features Kaggle dataset 4 . This dataset contains more than 116,000 unique tracks, and includes audio features for each track.…”
Section: Spotify Audio Featuresmentioning
confidence: 99%
“…It collected the data of seven well-known authoritative foreign music communities, sorted out and analyzed the data, and provided researchers with offline datasets and analysis results obtained by various algorithms. e offline dataset given by Last.fm [32] is mostly used for the optimization method data in this subsection. e offline dataset given by Last.fm is separated into a training set and a test set, with the training set accounting for 80% of the dataset and the test set accounting for 20%.…”
Section: Directed Tag-based Collaborative Filtering Algorithm In Hete...mentioning
confidence: 99%