2009 First International Conference on Advances in Multimedia 2009
DOI: 10.1109/mmedia.2009.35
|View full text |Cite
|
Sign up to set email alerts
|

Content-Based Classification and Segmentation of Mixed-Type Audio by Using MPEG-7 Features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
13
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
2
2
2

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 10 publications
0
13
0
Order By: Relevance
“…[1][2][3][4][5][6][7][8][9] Concerning the audio data, the automatic analysis of the audio signals can offer the users useful information. In the case of broadcast news, automatic processing is related to tasks such as sound recognition, 10,11 speaker recognition, 12 anchor detection, 13 role detection, [14][15][16] story boundary detection, 2,17,18 summary construction from anchor talking, 9,19 channel's quality detection, 20 sound event detection, 21,22 non-linguistic humanproduced sounds detection, 5,6,[23][24][25] audio type segmentation in sport games, 4,26,27 highlight scene extraction from sports games, 3 violence scene detection, 28 music characteristics classification, 29,30 jingle detection, 1 commercial block detection, 8 voice activity detection, 31 language recognition, 32 emotion recognition 33 and speech recognition. 34 Sound recognition is the cornerstone of analysis as typically precedes the other stages.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…[1][2][3][4][5][6][7][8][9] Concerning the audio data, the automatic analysis of the audio signals can offer the users useful information. In the case of broadcast news, automatic processing is related to tasks such as sound recognition, 10,11 speaker recognition, 12 anchor detection, 13 role detection, [14][15][16] story boundary detection, 2,17,18 summary construction from anchor talking, 9,19 channel's quality detection, 20 sound event detection, 21,22 non-linguistic humanproduced sounds detection, 5,6,[23][24][25] audio type segmentation in sport games, 4,26,27 highlight scene extraction from sports games, 3 violence scene detection, 28 music characteristics classification, 29,30 jingle detection, 1 commercial block detection, 8 voice activity detection, 31 language recognition, 32 emotion recognition 33 and speech recognition. 34 Sound recognition is the cornerstone of analysis as typically precedes the other stages.…”
Section: Introductionmentioning
confidence: 99%
“…The most commonly used are the Gaussian mixture models and the hidden Markov models. 10,11,14,26,37,40 Also widely used are the support vector machines, 11,14,38,39,41 the artificial neural networks, 10 the k-nearest neighbor algorithm, 14,38 the decision trees, 10,38 the genetic algorithms, 2 the fuzzy logic 42 and boosting techniques. 41,43 Related architectures incorporate fusion frameworks among recognition models 28,44 and combination of model-based and distance based algorithms.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…SVM was used also in [8], where it was applied to transform domain indexing by using a non-standard audio codec in a music genreclassification application. As a popular classifier, SVM was also used, along with the Hidden Markov Models (HMMs), in [9] to classify audio content into five non-silent classes. In [9], a unique HMM-model is trained for each non-silent class using MPEG-7 features.…”
Section: Introductionmentioning
confidence: 99%
“…As a popular classifier, SVM was also used, along with the Hidden Markov Models (HMMs), in [9] to classify audio content into five non-silent classes. In [9], a unique HMM-model is trained for each non-silent class using MPEG-7 features. Training set encapsulated 50% of the entire dataset in achieving the reported accuracy rates, which are, however, highly dependent on the selection of the SVM parameters, which is a well-known fact in the field.…”
Section: Introductionmentioning
confidence: 99%