2013
DOI: 10.1016/j.dsp.2013.07.007
|View full text |Cite
|
Sign up to set email alerts
|

Constrained temporal structure for text-dependent speaker verification

Abstract: In the context of mobile devices, speaker recognition engines may suffer from ergonomic constraints and limited amount of computing resources. Even if they prove their efficiency in classical contexts, GMM/UBM systems show their limitations when restricting the quantity of speech data. In contrast, the proposed GMM/UBM extension addresses situations characterised by limited enrolment data and only the computing power typically found on modern mobile devices. A key contribution comes from the harnessing of the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…Note that the lower the equal error rate (EER) value, the higher the accuracy of the system. Ergonomic constraints and limited amount of computing resources were among the motivations of Larcher et al while presenting their study about speaker recognition engines working on mobile devices [52]. Such systems may show efficient performance in classical context; however, their limitations will appear when restricting the quantity of speech data.…”
Section: Published Work In the Year 2013mentioning
confidence: 99%
“…Note that the lower the equal error rate (EER) value, the higher the accuracy of the system. Ergonomic constraints and limited amount of computing resources were among the motivations of Larcher et al while presenting their study about speaker recognition engines working on mobile devices [52]. Such systems may show efficient performance in classical context; however, their limitations will appear when restricting the quantity of speech data.…”
Section: Published Work In the Year 2013mentioning
confidence: 99%
“…In [8], the LFCC feature extraction technique with an MYLDEA Dataset and three modeling techniques are proposed. LFCC is a feature extraction technique used in the field of speaker recognition.…”
Section: Linear Frequency Cepstral Coefficients (Lfcc)mentioning
confidence: 99%
“…In context of mobile devices, ASV engines are susceptible to suffer from limited computing resources and ergonomic constraints. A GMM‐UBM extension was prescribed in [78] to compensate the situations characterised by constrained amount of enrolment data and computation facility, typically available on hand‐held mobile devices. The key contribution was influenced from the idea of incorporation of temporal structure information of speech using pass‐phrases customised by the client and new Markov model structures in addition to it.…”
Section: Research In Asv On Short Utterancesmentioning
confidence: 99%