2020
DOI: 10.3390/info11040211
|View full text |Cite
|
Sign up to set email alerts
|

Investigation of Spoken-Language Detection and Classification in Broadcasted Audio Content

Abstract: The current paper focuses on the investigation of spoken-language classification in audio broadcasting content. The approach reflects a real-word scenario, encountered in modern media/monitoring organizations, where semi-automated indexing/documentation is deployed, which could be facilitated by the proposed language detection preprocessing. Multilingual audio recordings of specific radio streams are formed into a small dataset, which is used for the adaptive classification experiments, without seeking-at this… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 31 publications
(68 reference statements)
0
8
0
Order By: Relevance
“…How to use free and open-source software from and through the Internet to quickly and easily create sound/audio media spots (see Kotsakis et al 2020;Vryzas et al 2020;Kalliris et al 2019;Matsiola et al 2019;Nicolaou et al 2019;Tsipas, Nikolaos, LazarosVrysis, CharalamposDimoulas, and George Papanikolaou, 2015;Kotsakis et al 2012a;Kotsakis et al 2012b;Aguayo Gonzalez et al 2009;Salmon et al 2008;Dimoulas et al 2000).…”
Section: Lesson Planmentioning
confidence: 99%
“…How to use free and open-source software from and through the Internet to quickly and easily create sound/audio media spots (see Kotsakis et al 2020;Vryzas et al 2020;Kalliris et al 2019;Matsiola et al 2019;Nicolaou et al 2019;Tsipas, Nikolaos, LazarosVrysis, CharalamposDimoulas, and George Papanikolaou, 2015;Kotsakis et al 2012a;Kotsakis et al 2012b;Aguayo Gonzalez et al 2009;Salmon et al 2008;Dimoulas et al 2000).…”
Section: Lesson Planmentioning
confidence: 99%
“…Motivated by the results of a previous research on program-adaptive pattern analysis for Voice/Music/Phone [1] and Language Discrimination taxonomies [6], the presented methodology functions as an add-on module towards the formulation of a dynamic Generic Audio Classification Repository. Hence, following already adopted hierarchical classification strategies, new schemes were adapted based on clustering techniques, but also their combination with supervised training methods.…”
Section: Background Work and Problem Definitionmentioning
confidence: 99%
“…However, there are issues regarding the inhomogeneity of labeling meta-data, while in some cases, ground-truth training pairs are difficult to obtain (or are even completely unavailable). Hence, combinations of supervised, semi-supervised and unsupervised data mining algorithms are utilized to serve the specific necessities of various real-world multimedia semantics [1,[4][5][6][7][8][9].…”
Section: Introductionmentioning
confidence: 99%
“…The first LID was designed and explored with acoustic features to build a dictionary of words using speech sound. Prosody features also play a vital role in recognizing language from speech signal [1,2]. Prosodic features like pitch, energy, stress are different in tonal language compared to non-tonal languages [3,4].…”
Section: Literature Surveymentioning
confidence: 99%
“…It is the task to recognize the language of utterance without knowing the details of speaker and language content. It identifies the languages of speech utterance based on only raw signal of speech utterance [1].…”
Section: Introductionmentioning
confidence: 99%