The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2nd International Conference on Data, Engineering and Applications (IDEA) 2020
DOI: 10.1109/idea49133.2020.9170720
|View full text |Cite
|
Sign up to set email alerts
|

Ensemble Learners for Identification of Spoken Languages using Mel Frequency Cepstral Coefficients

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 20 publications
0
8
0
Order By: Relevance
“…Arabic language identification for NLP is an important research effort, but few studies have been done in this area for both audio speech [20], [21], [22], [23], [24], and textual forms [25], [26]. Mel Frequency Cepstral Coefficient (MFCC) is commonly used to solve this problem [10], [11], [27], [28], and [29]. Heracleous et al [1] presented experiments for SLID by using Convolutional Neural Networks (CNN) and Deep Neural Networks (DNN) methods.…”
Section: Literature Reviewmentioning
confidence: 99%
See 2 more Smart Citations
“…Arabic language identification for NLP is an important research effort, but few studies have been done in this area for both audio speech [20], [21], [22], [23], [24], and textual forms [25], [26]. Mel Frequency Cepstral Coefficient (MFCC) is commonly used to solve this problem [10], [11], [27], [28], and [29]. Heracleous et al [1] presented experiments for SLID by using Convolutional Neural Networks (CNN) and Deep Neural Networks (DNN) methods.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Sisodia et al [29] assessed some of the ensemble learning methods for SLID using MFCC and Delta Mel Frequency Cepstral Coefficient (DFCC) features. They recorded speech audio files using five languages: French, Dutch, English, German, and Portuguese.…”
Section: Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…As a backup in the event that the proposed method's acoustic systems fail to properly identify the language of incoming speech with high confidence, a phonetic system is utilised, with the scores from both systems being pooled. Dilip Singh Sisodia et al, [6] Using senones as targets, they illustrated and investigated the key downsides of expanding the output layer size, with a particular focus on the size of the input temporal context, in addition to the benefits. As reported in that paper, a baseline monolingual feature trained on a language that was similar to the test language performed better than multilingually trained features.…”
Section: Related Workmentioning
confidence: 99%
“…Step 5: Build Feature Map The datasets in [1,4,6] are derived from the Indic speech corpus of the International Institute of Information Technology, Hyderabad (IIIT-H), which includes 1000 spoken sentences in each of seven different languages. Thus, they have utilized a total of 7000 audio samples in our language detection model.…”
Section: Related Workmentioning
confidence: 99%