Interspeech 2014 2014
DOI: 10.21437/interspeech.2014-419
|View full text |Cite
|
Sign up to set email alerts
|

Robust language identification using convolutional neural network features

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 27 publications
(5 citation statements)
references
References 16 publications
0
5
0
Order By: Relevance
“…These approaches required significant domain knowledge [4,16]. Nowadays most of the attempts on spoken language identification rely on neural networks for meaningful feature extraction and classification [15,7].…”
Section: Related Workmentioning
confidence: 99%
“…These approaches required significant domain knowledge [4,16]. Nowadays most of the attempts on spoken language identification rely on neural networks for meaningful feature extraction and classification [15,7].…”
Section: Related Workmentioning
confidence: 99%
“…Use of i-vectors requires significant domain knowledge (Dehak et al 2011b;Martínez et al 2011). In recent trends, researchers rely on neural networks for feature extraction and classification (Lopez-Moreno et al 2014;Ganapathy et al 2014). Researcher Revay and Teschke (2019) used the ResNet50 (He et al 2016) framework for classifying languages by generating the log-Mel spectra for each raw audio.…”
Section: Related Workmentioning
confidence: 99%
“…This work is characterised by a modified training strategy that provides equal class distribution and efficient memory utilisation. Ganapathy et al (2014) reported how they used bottleneck features from a CNN for the LID task. Bottleneck features were used in conjunction with conventional acoustic features, and performance was evaluated.…”
Section: Related Workmentioning
confidence: 99%
“…The background model can be a Gaussian mixture model (GMM) [14] or a DNN model [11], [15]. The i-vectors extracted from the training data are used to train classifiers such as support vector machines (SVMs) which perform the task of language identification [16], [17].…”
Section: Introductionmentioning
confidence: 99%