2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2012
DOI: 10.1109/icassp.2012.6288848
|View full text |Cite
|
Sign up to set email alerts
|

Generating exact lattices in the WFST framework

Abstract: We describe a lattice generation method that is exact, i.e. it satisfies all the natural properties we would want from a lattice of alternative transcriptions of an utterance. This method does not introduce substantial overhead above one-best decoding. Our method is most directly applicable when using WFST decoders where the WFST is "fully expanded", i.e. where the arcs correspond to HMM transitions. It outputs lattices that include HMM-state-level alignments as well as word labels. The general idea is to crea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
110
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 119 publications
(115 citation statements)
references
References 3 publications
(7 reference statements)
0
110
0
Order By: Relevance
“…An overview about acoustic models based on deep neural networks can be found in [57,55]. However, in this thesis we employ the traditional HMM acoustic models with more recent techniques that Kaldi [105] speech recognition toolkit provides.…”
Section: Acoustic Modelmentioning
confidence: 99%
See 2 more Smart Citations
“…An overview about acoustic models based on deep neural networks can be found in [57,55]. However, in this thesis we employ the traditional HMM acoustic models with more recent techniques that Kaldi [105] speech recognition toolkit provides.…”
Section: Acoustic Modelmentioning
confidence: 99%
“…The baseline ASR system is built by using the Kaldi [105] speech recognition toolkit. The language model that the baseline system uses is the baseline tri-gram back-off model for 20K open vocabulary for non-verbalized punctuation that is also available in the corpus.…”
Section: Asr Baselinementioning
confidence: 99%
See 1 more Smart Citation
“…Added noise sources are typically non-stationary (e.g., other speakers' utterances, home noises, or music). We used Kaldi toolkit [30] for the experiments.…”
Section: Task Descriptionmentioning
confidence: 99%
“…The Kaldi decoder generates word lattices [17] for the eval data using the GMM+SAT, SGMM and SGMM+BMMI models. The decoding lexicon is varied systematically, from the low resource lexicon of 5.7K words (8.9K pronunciations), through automatically augmented lexicons of three different sizes, to the full Babel reference lexicon of 23K words (35K pronunciations).…”
Section: Kaldi-based Lvcsr System Descriptionmentioning
confidence: 99%