Distinctive Phonetic Feature (DPF) Extraction Based on MLNs and Inhibition/Enhancement Network

Huda, Mohammad Nurul; Kawashima, Hiroyuki; Nitta, Tsuneo

doi:10.1587/transinf.e92.d.671

Cited by 14 publications

(18 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this method, three MLNs instead of two MLNs [5] MLN LF-DPF , outputs DPFs [11,12] for the inputted acoustic features, LFs [15], while the second MLN, MLN cntxt , reduces misclassification at phoneme boundaries by taking seven frame context (from t-3 to t+3) as input, and the third MLN, MLN Dyn , restricts the DPF dynamics by incorporating dynamic parameters ( DPF and DPF) into its input. Here, the MLN LF-DPF , which is trained using the standard back-propagation learning algorithm, has two hidden layers of 256 and 96 units, respectively and takes three input vectors (t-3, t, t+3) of LFs of 25 dimensions each.…”

Section: A Dpf Extractormentioning

confidence: 99%

“…This phoneme misclassification sometimes occurs when the values of DPF peaks and DPF dips are closer to each other. Therefore, a mechanism, which is called In/En network [5,6,7], is needed to obtain clearly separable DPF peaks and dips. An algorithm for this network is given below:…”

Section: B Inhibition/enhancement Networkmentioning

confidence: 99%

“…Though various distinctive phonetic features (DPFs) based methods, which can solve coarticulatory phenomena [3,4], were proposed to obtain a more accurate phoneme recognizer [5,6,7,8], they provide a higher phoneme correct rate (PCR) with poor phoneme accuracy rate (PAR). The reason for providing lower phoneme recognition accuracy is the violation of some phonotactic constraints at different places of the output phoneme string of an input utterance.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Effects of Syllable Language Model on Distinctive Phonetic Features (DPFs) based Phoneme Recognition Performance

Huda¹,

Banik²,

Muhammad³

et al. 2010

JMM

Self Cite

View full text Add to dashboard Cite

This paper presents a distinctive phonetic features (DPFs) based phoneme recognition method by incorporating syllable language models (LMs). The method comprises three stages. The first stage extracts three DPF vectors of 15 dimensions each from local features (LFs) of an input speech signal using three multilayer neural networks (MLNs). The second stage incorporates an Inhibition/Enhancement (In/En) network to obtain more categorical DPF movement and decorrelates the DPF vectors using the Gram-Schmidt orthogonalization procedure. Then, the third stage embeds acoustic models (AMs) and LMs of syllable-based subwords to output more precise phoneme strings. From the experiments, it is observed that the proposed method provides a higher phoneme correct rate as well as a tremendous improvement of phoneme accuracy. Moreover, it shows higher phoneme recognition performance at fewer mixture components in hidden Markov models (HMMs).

show abstract

Section: A Dpf Extractormentioning

confidence: 99%

Section: B Inhibition/enhancement Networkmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Effects of Syllable Language Model on Distinctive Phonetic Features (DPFs) based Phoneme Recognition Performance

Huda¹,

Banik²,

Muhammad³

et al. 2010

JMM

Self Cite

View full text Add to dashboard Cite

show abstract

“…Various methods of phoneme recognition based on Inhibition/Enhancement (In/En) network were proposed by Huda, et al [1][2][3][4]. These papers introduced In/En functionality to discriminate whether the distinctive phonetic features (DPFs) dynamic patterns of trajectories are convex or concave.…”

Section: Introductionmentioning

confidence: 99%

“…These papers showed that In/En network has an effect of improving phoneme recognition performance in clean acoustic environment. The impact of In/En network in practical condition was not analyzed in [1][2][3][4].…”

Section: Introductionmentioning

confidence: 99%

Inhibition/Enhancement Network Based ASR using Multiple DPF Extractors

Hassan¹,

Kotwal²,

Hasan³

et al. 2011

JMM

Self Cite

View full text Add to dashboard Cite

This paper describes an evaluation ofInhibition/Enhancement (In/En) network for robustautomatic speech recognition (ASR). In distinctive phoneticfeatures (DPFs) based speech recognition using neuralnetwork, In/En network is needed to discriminate whetherthe DPFs dynamic patterns of trajectories are convex orconcave. The network is used to achieve categorical DPFsmovement by enhancing DPFs peak patterns (convexpatterns) and inhibiting DPFs dip patterns (concavepatterns). We have analyzed the effectiveness of In/Enalgorithm by incorporating it into a system which consists ofthree stages: a) Multilayer Neural Networks (MLNs), b)In/En Network and c) Gram-Schmidt (GS)orthogonalization. From the experiments using JapaneseNewspaper Article Sentences (JNAS) database in clean andnoisy acoustic environments, it is observed that the In/Ennetwork plays a significant role on the improvement ofphoneme recognition performance. Moreover, In/Ennetwork reduces required number of mixture componentsin Hidden Markov Models (HMMs)

show abstract

Phoneme recognition based on distinctive phonetic features (DPFs) incorporating a syllable based language model

Huda

Banik

Muhammad

et al. 2009

2009 12th International Conference on Computers and Information Technology

View full text Add to dashboard Cite

Distinctive Phonetic Feature (DPF) Extraction Based on MLNs and Inhibition/Enhancement Network

Cited by 14 publications

References 13 publications

Effects of Syllable Language Model on Distinctive Phonetic Features (DPFs) based Phoneme Recognition Performance

Effects of Syllable Language Model on Distinctive Phonetic Features (DPFs) based Phoneme Recognition Performance

Inhibition/Enhancement Network Based ASR using Multiple DPF Extractors

Phoneme recognition based on distinctive phonetic features (DPFs) incorporating a syllable based language model

Contact Info

Product

Resources

About