2018 IEEE Spoken Language Technology Workshop (SLT) 2018
DOI: 10.1109/slt.2018.8639689
|View full text |Cite
|
Sign up to set email alerts
|

Exploring End-To-End Attention-Based Neural Networks For Native Language Identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(3 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…Many studies on language identification have been conducted, with various feature extraction and classification techniques being used. Several techniques are used to extract features from the audio data, including phone recognition followed by language modeling (PRLM) [5] and parallel phone recognition followed by language modeling (PPRLM) [5] for phonetic approach or perceptual linear prediction (PLP) [5], mel-frequency cepstral coefficient (MFCC) [6]- [8], i-vector [8], [9] and x-vector [10] for the acoustic approx neural networks [11], convolutional neural networks (CNN) [12], [13], logistic regression (LR) [8], PLDA [14], Gaussian mixture model (GMM) [15], [16], support vector machine [17], [18] are among techniques used to classify the language spoken.…”
Section: Introductionmentioning
confidence: 99%
“…Many studies on language identification have been conducted, with various feature extraction and classification techniques being used. Several techniques are used to extract features from the audio data, including phone recognition followed by language modeling (PRLM) [5] and parallel phone recognition followed by language modeling (PPRLM) [5] for phonetic approach or perceptual linear prediction (PLP) [5], mel-frequency cepstral coefficient (MFCC) [6]- [8], i-vector [8], [9] and x-vector [10] for the acoustic approx neural networks [11], convolutional neural networks (CNN) [12], [13], logistic regression (LR) [8], PLDA [14], Gaussian mixture model (GMM) [15], [16], support vector machine [17], [18] are among techniques used to classify the language spoken.…”
Section: Introductionmentioning
confidence: 99%
“…There are several steps to identify language, starting from cleaning the data from noise to help the system get better accuracy, extracting the feature from speech data, and classifying the language. There are several techniques to classify the language spoken, including neural networks [2], convolutional neural networks [3]- [6], logistic regression [7], PLDA [8], gaussian mixture model [9], [10], support vector machine [11], [12], and several techniques to extract the features from the recording, such as MFCC [13], [14], ivector [15]- [17], and x-vector [18].…”
Section: Introductionmentioning
confidence: 99%
“…With the invent of convolution neural networks (CNN) in dialect classification [19], [20] which can handle variable length utterances along with classification, three stages reduced to two. In [19], CNNs are evaluated over the Arabic database (MGB-3) with various acoustic features such asmel-frequency cepstral coefficients (MFCC), log mel-scale filterbank energies (FBANK), and spectrogram.…”
Section: Introductionmentioning
confidence: 99%