2013
DOI: 10.1007/s10772-013-9209-1
|View full text |Cite
|
Sign up to set email alerts
|

GMM based language identification system using robust features

Abstract: In this work, we have proposed new feature vectors for spoken language identification (LID) system. The Mel frequency cepstral coefficients (MFCC) and formant frequencies derived using short-time window speech signal. Formant frequencies are extracted from linear prediction (LP) analysis of speech signal. Using these two kind of features of speech signal, new feature vectors are derived using cluster based computation. A GMM based classifier has been designed using these new feature vectors. The language speci… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 20 publications
(2 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…Specifically, we plan to pose the problem as one of learning/optimizing the representation matrix, where other evolutionary algorithms could be used. Also, we are interested on learning term-weighting schemes for other domains, like audio [34], time series [46] or accelerometer data [19], and other scenarios as one-shot recognition [21] and early classification [14].…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Specifically, we plan to pose the problem as one of learning/optimizing the representation matrix, where other evolutionary algorithms could be used. Also, we are interested on learning term-weighting schemes for other domains, like audio [34], time series [46] or accelerometer data [19], and other scenarios as one-shot recognition [21] and early classification [14].…”
Section: Discussionmentioning
confidence: 99%
“…The success of the bag of words representation in the natural language processing domain has inspired researchers in computer vision as well, and currently the BoVW is among the most used representations for images and videos [20,43,42,30,35,3,9,50,5,28]. In fact, this formulation has trespassed the image and text boundaries, and it has been used for representing audio [34], time series [46], accelerometer [19] signals, etc. In the computer vision analogy, under the BoVW, an image is represented by a vector indicating the importance of visual words for describing the content of the image.…”
Section: Bovw Representationmentioning
confidence: 99%