2017
DOI: 10.1007/978-3-319-68612-7_40
|View full text |Cite
|
Sign up to set email alerts
|

Masked Conditional Neural Networks for Audio Classification

Abstract: We present the ConditionaL Neural Network (CLNN) and the Masked ConditionaL Neural Network (MCLNN) 1 designed for temporal signal recognition. The CLNN takes into consideration the temporal nature of the sound signal and the MCLNN extends upon the CLNN through a binary mask to preserve the spatial locality of the features and allows an automated exploration of the features combination analogous to hand-crafting the most relevant features for the recognition task. MCLNN has achieved competitive recognition accu… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
5
1
1

Relationship

4
3

Authors

Journals

citations
Cited by 14 publications
(25 citation statements)
references
References 19 publications
0
25
0
Order By: Relevance
“…Probabilistic techniques combined with deep learning have been explored too. Medhat et al (2017) propose a type of neural networks designed for temporal signal recognition, the Conditional Neural Network and the Masked Conditional Neural Network achieving accuracy levels between 85% and 86% on the GTZAN and ISMIR2004 datasets.…”
Section: Content-based Classificationmentioning
confidence: 99%
“…Probabilistic techniques combined with deep learning have been explored too. Medhat et al (2017) propose a type of neural networks designed for temporal signal recognition, the Conditional Neural Network and the Masked Conditional Neural Network achieving accuracy levels between 85% and 86% on the GTZAN and ISMIR2004 datasets.…”
Section: Content-based Classificationmentioning
confidence: 99%
“…The ConditionaL Neural Network (CLNN) [13] is a discriminative model designed for temporal signals. The CLNN extends from the visible to hidden links proposed in the CRBM.…”
Section: Conditional Neural Networkmentioning
confidence: 99%
“…The models we discuss in this work have been considered in [13] for music genre classification with more emphasis on the influence of the data split (training set, validation set, and testing set) on the reported accuracies in the literature. In this work, we evaluate the applicability of the models to sounds of a different nature, i.e.…”
Section: Introductionmentioning
confidence: 99%
“…Weight sharing makes the CNN translation invariant, which does not preserve the spatial locality of the learned features. The ConditionaL Neural Networks (CLNN) [17] and its variant the Masked ConditionaL Neural Network (MCLNN) [17] are developed from the ground up exploiting the nature of the sound signal. The CLNN considers the interframes relation in a temporal signal and the MCLNN embeds a filterbank-like behavior that enables individual bands and suppresses others through an enforced systematic sparseness.…”
Section: Introductionmentioning
confidence: 99%