2021
DOI: 10.1088/1742-6596/1998/1/012024
|View full text |Cite
|
Sign up to set email alerts
|

Parasitic sorority of speech processing algorithms with an assortment of statistical toolkits

Abstract: Speech is a one-dimensional quasi non-stationary time varying signal produced by a sequence of sounds. Speech signals are random in nature. Speech signals are easily corrupted by noise so recognition is an important role in speech processing. Many researches have designed recognition system with challenging parameters. Speech corpus can vary from environment, region, dialects, age, rate at which words are spoken. Pre-processing is the first step which includes framing, de-noisingand filtering. This paper focus… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 15 publications
0
0
0
Order By: Relevance
“…Although various tools including HTK, Sphinx, and Kaldi have been selected and designed for building HMM speech-based audio data processing for particular ASR isolated word-level recognition [33]. HTK, the most popular toolkit for building Hidden Markov models, was created especially for the implementation of speech-based isolated word recognition [31]. Terefore, HTK toolkits were selected for the investigation of Afan Oromo isolated speech-based recognition computer commands.…”
Section: Te Htk Software Toolkitmentioning
confidence: 99%
See 1 more Smart Citation
“…Although various tools including HTK, Sphinx, and Kaldi have been selected and designed for building HMM speech-based audio data processing for particular ASR isolated word-level recognition [33]. HTK, the most popular toolkit for building Hidden Markov models, was created especially for the implementation of speech-based isolated word recognition [31]. Terefore, HTK toolkits were selected for the investigation of Afan Oromo isolated speech-based recognition computer commands.…”
Section: Te Htk Software Toolkitmentioning
confidence: 99%
“…A signifcant change in overall accuracy in speech models is observed because of advancements in open source toolkits HTK, CMU-Sphinx, and Kaldi and their fastprocessing speed-based ASR speech recognition. Te performance of a speech system is difcult because it is dependent on variations in speakers, their pronunciations, the rate at which they speak, and the dialects of the regions they belong to [31]. ASR speech-based computer command recognizer accuracy varies with ambiguity and vocabulary size; hence, hybrid HMM works best for large vocabulary and HMM works best for small vocabulary [32].…”
mentioning
confidence: 99%