2022
DOI: 10.3390/s22103683
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language

Abstract: Communication has been an important aspect of human life, civilization, and globalization for thousands of years. Biometric analysis, education, security, healthcare, and smart cities are only a few examples of speech recognition applications. Most studies have mainly concentrated on English, Spanish, Japanese, or Chinese, disregarding other low-resource languages, such as Uzbek, leaving their analysis open. In this paper, we propose an End-To-End Deep Neural Network-Hidden Markov Model speech recognition mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
19
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9
1

Relationship

2
8

Authors

Journals

citations
Cited by 39 publications
(19 citation statements)
references
References 61 publications
(63 reference statements)
0
19
0
Order By: Relevance
“…In contrast, the one-step method carefully samples facial positions and scales to derive true and false samples without training principles. The sampling [ 38 ] and reweighting [ 13 ] techniques are widely used to reduce this imbalance. Compared to the two-step method, the one-step method is very productive and has a very high recall, but is at the risk of higher false-positive rates and less accurate localization.…”
Section: Related Workmentioning
confidence: 99%
“…In contrast, the one-step method carefully samples facial positions and scales to derive true and false samples without training principles. The sampling [ 38 ] and reweighting [ 13 ] techniques are widely used to reduce this imbalance. Compared to the two-step method, the one-step method is very productive and has a very high recall, but is at the risk of higher false-positive rates and less accurate localization.…”
Section: Related Workmentioning
confidence: 99%
“…Several vision-based methods have been proposed for wildfire smoke detection, the most prominent of which are image color and deep CNN. In addition to recent successes of DL in natural language processing [ 44 ] and image classification [ 45 ], significant progress has also been made in DL-based wildfire detection methods.…”
Section: Related Workmentioning
confidence: 99%
“…Mukhamadiyev et al proposed an end-to-end deep neural network-hidden Markov model speech recognition model and a hybrid connectionist temporal classification (CTC)-attention network for the Uzbek language and its dialects. The proposed approach reduces training time and improves speech recognition accuracy by effectively using the CTC objective function in attention model training [ 23 ]. Feature extraction and classification using a one-dimensional convolutional neural network (CNN) model, which divides heart sound signals into normal and abnormal directly independent of an electrocardiogram (ECG), was proposed in [ 24 ].…”
Section: Related Workmentioning
confidence: 99%