2020
DOI: 10.1109/access.2020.3041477
|View full text |Cite
|
Sign up to set email alerts
|

Projected Minimal Gated Recurrent Unit for Speech Recognition

Abstract: Recurrent neural network (RNN) has the ability to learn long-term dependencies, which makes it suitable for acoustic modeling in speech recognition. In this paper, we revise RNN model used in acoustic modeling, namely, mGRUIP with Context module (mGRUIP-Ctx), and propose an advanced model which named Projected minimal Gated Recurrent Unit (PmGRU). The paper demonstrates two major contributions: firstly, in the case that adding context information to context module in mGRUIP-Ctx will bring about large amount of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 36 publications
(43 reference statements)
0
2
0
Order By: Relevance
“…Feng et al [26] proposed a Projected minimal Gated Recurrent Unit (PmGRU) an improved version of mGRUIP-Ctx for speech recognition acoustic model on five different ASR tasks. The proposed model has shown significant reduction in Word Error Rate (WER) compared with the WER of the mGRUIP-Ctx.…”
Section: B Deep Learning Based Methods For Automatic Speech Recogniti...mentioning
confidence: 99%
“…Feng et al [26] proposed a Projected minimal Gated Recurrent Unit (PmGRU) an improved version of mGRUIP-Ctx for speech recognition acoustic model on five different ASR tasks. The proposed model has shown significant reduction in Word Error Rate (WER) compared with the WER of the mGRUIP-Ctx.…”
Section: B Deep Learning Based Methods For Automatic Speech Recogniti...mentioning
confidence: 99%
“…First, the temporal feature extraction network can extract the variation characteristics of the time series in both temporal and spatial dimensions from multiple dimensions. Second, the feature extraction network based on the gated recurrent unit is more advantageous than the LSTM network in handling large batches of temporal data, which can significantly reduce the training time of the model while ensuring the accuracy of the prediction (Feng et al, 2020). The six kinds of water quality information were passed through the GRU module to adjust the parameter states between the hidden layers.…”
Section: System Structurementioning
confidence: 99%
“…W ITH the development of deep neural network technology, no matter in target recognition [1]- [3], object detection [4], semantic segmentation [5], speech recognition [6], [7], or in text translation [8], these learning models based on deep neural networks have achieved significant progress. The success of these models depends on a large quantity of training samples.…”
Section: Introductionmentioning
confidence: 99%