2022 9th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE) 2022
DOI: 10.1109/icitacee55701.2022.9924132
|View full text |Cite
|
Sign up to set email alerts
|

A Visual Inspection Tool for Evaluation of ASR Model Using PyKaldi and PyCHAIN

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…The ASR model is trained using Kaldi and PyChain, and its testing and evaluation are carried out iteratively by measuring the WER value. During the evaluation process, the alignment of transcripts and speech phonemes recognized by ASR is visually inspected using an evaluation tool [19]. The evaluation results provide feedback to improve  ISSN: 2252-8938 the data preparation process, optimize the language model specification and lexicon, and refine data augmentation techniques.…”
Section: Asr Development For Spwpm Servicementioning
confidence: 99%
See 1 more Smart Citation
“…The ASR model is trained using Kaldi and PyChain, and its testing and evaluation are carried out iteratively by measuring the WER value. During the evaluation process, the alignment of transcripts and speech phonemes recognized by ASR is visually inspected using an evaluation tool [19]. The evaluation results provide feedback to improve  ISSN: 2252-8938 the data preparation process, optimize the language model specification and lexicon, and refine data augmentation techniques.…”
Section: Asr Development For Spwpm Servicementioning
confidence: 99%
“…To analyze and understand the reasons behind ASR inaccuracies, researchers need a tool that enables visual examination and monitoring of the data generated during the ASR recognition process, including transcripts and speech alignment. A tool developed by Jarin et al [19] serves this purpose, providing researchers with assistance. This tool utilizes the PyKaldi [33] and PyChain modules to display speech waveforms, and MFCC features on an HTML5 canvas.…”
Section: Evaluating Asr With a Kaldi-based Asr Evaluation Toolmentioning
confidence: 99%