Incremental Dataset Definition for Large Scale Musicological Research

Wolff, Daniel; Tidhar, Dan; Benetos, Emmanouil; Dumon, Edouard; Cherla, Srikanth; Weyde, Tillman

doi:10.1145/2660168.2660176

Cited by 2 publications

(3 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The statistics of these datasets are listed in Table 1. The OpenBMAT [6] and ORF TV dataset [44] are collected from TV programs, and AVASpeech [10] comprises audio from YouTube, while Muspeak [14] has a variety of content such as concert, radio broadcast, and low-fidelity folk music.…”

Section: Methodsmentioning

confidence: 99%

“…Note that % of music/speech is estimated based on the duration labeled as music or speech and the total duration of the audio content means they can be used only for either speech or music detection, but not both. The GTZAN Speech and Music dataset [11], Scheirer & Slaney Music Speech [12], MUSAN [13], and Muspeak [14] datasets contain only short segments and non-overlapping speech or music labels. Thus, these datasets can only be used for a simplified music and speech segmentation task, where the audio segments can only be classified into either speech, music, or noise without any overlap.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A large TV dataset for speech and music activity detection

Hung

Orife

et al. 2022

J AUDIO SPEECH MUSIC PROC.

View full text Add to dashboard Cite

Automatic speech and music activity detection (SMAD) is an enabling task that can help segment, index, and pre-process audio content in radio broadcast and TV programs. However, due to copyright concerns and the cost of manual annotation, the limited availability of diverse and sizeable datasets hinders the progress of state-of-the-art (SOTA) data-driven approaches. We address this challenge by presenting a large-scale dataset containing Mel spectrogram, VGGish, and MFCCs features extracted from around 1600 h of professionally produced audio tracks and their corresponding noisy labels indicating the approximate location of speech and music segments. The labels are several sources such as subtitles and cuesheet. A test set curated by human annotators is also included as a subset for evaluation. To validate the generalizability of the proposed dataset, we conduct several experiments comparing various model architectures and their variants under different conditions. The results suggest that our proposed dataset is able to serve as a reliable training resource and leads to SOTA performances on various public datasets. To the best of our knowledge, this dataset is the first large-scale, open-sourced dataset that contains features extracted from professionally produced audio tracks and their corresponding frame-level speech and music annotations.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A large TV dataset for speech and music activity detection

Hung

Orife

et al. 2022

J AUDIO SPEECH MUSIC PROC.

View full text Add to dashboard Cite

show abstract

“…-Adaptive learning: adaptive learning systems can continuously collect and interpret student data, change the direction and environment of students' learning, taking into account their needs and abilities [16].…”

Section: Big Data and Learning Analytics In General Education Schoolsmentioning

confidence: 99%

Teachers' Data Literacy Skills for Pedagogical Decision Making: Needs Analysis in Lithuania and Germany

Melnikova

Batuchina

Ahrens

et al. 2023

ETR

View full text Add to dashboard Cite

The purpose of the article is to analyse the needs of general education schoolteachers’ data literacy skills that are important for the effective use of learning analytics in the teaching-learning process. The theoretical part of the article presents the idea of big data in education, highlights the aspects of pedagogical value of learning analytics technologies, provides the overview of learning analytic tools. Some overview and comparison of spread of learning analytics tools in general education schools in Lithuania and Germany is presented in the context of data-driven education. The empirical part of the article presents some results from a big qualitative study of teachers’ experiences applying learning analytics tools in teaching - learning process. The main question of the current research is what data literacy skills teachers need in order to use learning analytics tools and make data based pedagogical decisions. Semi-structured interviews were conducted with 10 Lithuanian and 9 German teachers from general education schools, who already have had experience in working with learning experience platforms (digital learning platforms that integrate learning analytics tools). Interview data were analysed by means of content analysis. The results of the qualitative study showed that in order to use learning analytics tools it is important for teachers to have such skills as: digital literacy, data collection, data analysis and interpretation, etc. Comparative analysis of informants’ answers showed that teachers in Lithuania and Germany expressed similar needs for data literacy skills.

show abstract

Incremental Dataset Definition for Large Scale Musicological Research

Cited by 2 publications

References 19 publications

A large TV dataset for speech and music activity detection

A large TV dataset for speech and music activity detection

Teachers' Data Literacy Skills for Pedagogical Decision Making: Needs Analysis in Lithuania and Germany

Contact Info

Product

Resources

About