2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2011
DOI: 10.1109/icassp.2011.5947494
|View full text |Cite
|
Sign up to set email alerts
|

Deep Belief Networks using discriminative features for phone recognition

Abstract: Deep Belief Networks (DBNs) are multi-layer generative models. They can be trained to model windows of coefficients extracted from speech and they discover multiple layers of features that capture the higher-order statistical structure of the data. These features can be used to initialize the hidden units of a feed-forward neural network that is then trained to predict the HMM state for the central frame of the window. Initializing with features that are good at generating speech makes the neural network perfo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
268
2
3

Year Published

2012
2012
2023
2023

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 335 publications
(274 citation statements)
references
References 13 publications
1
268
2
3
Order By: Relevance
“…In recent years, deep learning models have been used for phonetic classification and recognition on a variety of speech tasks and showed promising results [7,8]. A Deep Boltzmann Machine is a network of symmetrically coupled stochastic binary units [6,9].…”
Section: Deep Boltzmann Machinesmentioning
confidence: 99%
“…In recent years, deep learning models have been used for phonetic classification and recognition on a variety of speech tasks and showed promising results [7,8]. A Deep Boltzmann Machine is a network of symmetrically coupled stochastic binary units [6,9].…”
Section: Deep Boltzmann Machinesmentioning
confidence: 99%
“…In their seminal work, Mohamed et al [1] proposed to use a system composed of many layers of logistic units. In order to overcome the notoriously difficult problem of optimizing very deep networks, they proposed to use a layer-wise unsupervised learning algorithm, called Restricted Boltzmann Machine (RBM) [2], as a way to provide a sensible initialization and they demonstrated significant improvements over the baseline GMM.…”
Section: Introductionmentioning
confidence: 99%
“…7 min- utes), using 3-layer NN mapping brings more benefit. Recently, deep neural networks have been applied successfully for speech recognition [20]- [23]. They show significant improvements over 3-layer NNs.…”
Section: Discussion On Mapping Structurementioning
confidence: 99%