2012
DOI: 10.1109/tasl.2011.2134090
|View full text |Cite
|
Sign up to set email alerts
|

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition

Abstract: Abstract-We propose a novel context-dependent (CD) model for large vocabulary speech recognition (LVSR) that leverages recent advances in using deep belief networks for phone recognition. We describe a pre-trained deep neural network hidden Markov model (DNN-HMM) hybrid architecture that trains the DNN to produce a distribution over senones (tied triphone states) as its output. The deep belief network pre-training algorithm is a robust and often helpful way to initialize deep neural networks generatively that … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

7
1,175
0
12

Year Published

2013
2013
2022
2022

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 2,670 publications
(1,244 citation statements)
references
References 61 publications
(67 reference statements)
7
1,175
0
12
Order By: Relevance
“…State-ofthe-art performance has been reported in several domains, ranging from speech recognition [1], visual object recognition [2] to text processing [3]. In fact, it could be argued that the network's learning ability has been a crucial factor in the recent success of pattern recognition applications.…”
Section: Introductionmentioning
confidence: 99%
“…State-ofthe-art performance has been reported in several domains, ranging from speech recognition [1], visual object recognition [2] to text processing [3]. In fact, it could be argued that the network's learning ability has been a crucial factor in the recent success of pattern recognition applications.…”
Section: Introductionmentioning
confidence: 99%
“…Large-scale deep learning, for instance, Deep Neutral Network (DNN), has driven advances with higher accuracy than traditional techniques in many different fields especially at image classification [1,2], speech recognition [3] and text processing [4]. To train DNNs on a large-scale, researchers have exploited distributed deep learning systems [1,5,6,7], such as DistBelief [1] and Tensorflow [7].…”
Section: Introductionmentioning
confidence: 99%
“…Recently deep learning-based methods have demonstrated excellent performance on different artificial-intelligence tasks, including speech recognition (Hinton et al, 2012a), natural language processing (Dahl et al, 2012), and computer vision (Krizhevsky et al, 2012). In the later area, Convolutional Neural Networks (CNNs) play a major role for processing visual-related problems such as image classification (Lee et al, 2009;Sermanet et al, 2013), object detection , and face recognition (Taigman et al, 2014).…”
Section: Introductionmentioning
confidence: 99%