Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1553
|View full text |Cite
|
Sign up to set email alerts
|

DA-IICT/IIITV System for Low Resource Speech Recognition Challenge 2018

Abstract: This paper presents an Automatic Speech Recognition (ASR) system, in the Gujarati language, developed for Low Resource Speech Recognition Challenge for Indian Languages in INTER-SPEECH 2018. For front-end, Amplitude Modulation (AM) features are extracted using the standard and data-driven auditory filterbanks. Recurrent Neural Network Language Models (RNNLM) are used for this task. There is a relative improvement of 36.18 % and 40.95 % in perplexity on the test and blind test sets, respectively, compared to 3-… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…The data sets are different. [52], [53], [49], and [55] used isolated Gujarati words, [54] used 25-word sentences, [56] did not limit to number of words in the sentences, [51], [57] used continuous speech of three Indian languages, and [50] used continuous speech of 9 Indian languages. The table highlights the accuracy achieved with Gujarati language.…”
Section: Mathematical Evaluation Of Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The data sets are different. [52], [53], [49], and [55] used isolated Gujarati words, [54] used 25-word sentences, [56] did not limit to number of words in the sentences, [51], [57] used continuous speech of three Indian languages, and [50] used continuous speech of 9 Indian languages. The table highlights the accuracy achieved with Gujarati language.…”
Section: Mathematical Evaluation Of Resultsmentioning
confidence: 99%
“…There is no historical evidence of Cocktail-party scene with Gujarati language [47][48][49][50][51][52][53][54][55][56][57]. For ASR in Gujarati, methods like Statistical, Neural Networks and End-to-end recognition are used [35].…”
Section: Introductionmentioning
confidence: 99%
“…The DAIICT-IIITV Gujarati system [15] used a combination of TDNN and TDNN-LSTM Acoustic Models with various acoustic features. RNN-based Language Models for rescoring were found to outperform n-gram models.…”
Section: Resultsmentioning
confidence: 99%
“…There is no historical evidence of Cocktail-party scene with Gujarati language [47][48][49][50][51][52][53][54][55][56][57]. For ASR in Gujarati, methods like Statistical, Neural Networks and End-to-end recognition are used [35].…”
Section: Introductionmentioning
confidence: 99%