Improved clustered hierarchical tandem system with bottom-up processing

Chang, Shuo-Yiin; Lee, Lin-Shan

doi:10.1109/icassp.2009.4960615

Cited by 2 publications

(1 citation statement)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Hence it makes sense to construct a hierarchy of learners, especially if we have a priori knowledge about how the information is structured. Many efforts have been made to create hierarchic phone classifiers, a recent example being described in [12].…”

Section: Two-stage Estimation Of Posteriorsmentioning

confidence: 99%

A hierarchical, context-dependent neural network architecture for improved phone recognition

Tóth

2011

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

In this paper we combine three simple refinements proposed recently to improve HMM/ANN hybrid models. The first refinement is to apply a hierarchy of two nets, where the second net models the contextual relations of the state posteriors produced by the first network. The second idea is to train the network on context-dependent units (HMM states) instead of context-independent phones or phone states. As the latter refinement results in a lot of output neurons, combining the two methods directly would be problematic. Hence the third trick is to shrink the output layer of the first net using the bottleneck technique before applying the second net on top of it. The phone recognition results obtained on the TIMIT database demonstrate that both the context-dependent and the 2-stage modeling methods can bring about marked improvements. Using them in combination, however, results in a further significant gain in accuracy. With the bottleneck technique a further improvement can be obtained, especially when the number of context-dependent units is large.

show abstract

Section: Two-stage Estimation Of Posteriorsmentioning

confidence: 99%