2015
DOI: 10.3389/fbioe.2015.00126
|View full text |Cite
|
Sign up to set email alerts
|

Improving Speaker Recognition by Biometric Voice Deconstruction

Abstract: Person identification, especially in critical environments, has always been a subject of great interest. However, it has gained a new dimension in a world threatened by a new kind of terrorism that uses social networks (e.g., YouTube) to broadcast its message. In this new scenario, classical identification methods (such as fingerprints or face recognition) have been forcedly replaced by alternative biometric characteristics such as voice, as sometimes this is the only feature available. The present study benef… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

2
12
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(14 citation statements)
references
References 44 publications
2
12
0
Order By: Relevance
“…Researchers have attempted to find alternative features. For example, it has been found that voice sourcerelated features improve speaker recognition systems by providing information that complements conventional cepstral features [4,5,6]. Das et al also reported that features extracted from the voice source signal outperform MFCCs in ASV with test utterances shorter than 3 seconds [7].…”
Section: Introductionmentioning
confidence: 99%
“…Researchers have attempted to find alternative features. For example, it has been found that voice sourcerelated features improve speaker recognition systems by providing information that complements conventional cepstral features [4,5,6]. Das et al also reported that features extracted from the voice source signal outperform MFCCs in ASV with test utterances shorter than 3 seconds [7].…”
Section: Introductionmentioning
confidence: 99%
“…For example, Espy-Wilson et al used eight acoustic parameters consisting of both voice source and vocal tract features [11]. Mazaira-Fernandez et al separated voice source from vocal tract information, and they combined cepstral coefficients from those two estimates [12]. These studies showed the effectiveness of voice source information with promising results, but such information still has not been utilized extensively in ASpR.…”
Section: Introductionmentioning
confidence: 99%
“…The most common features used in audio recognition are the Mel-frequency Cepstral Coefficients (MFCCs). In the speech processing domain, the first thirteen MFCC values have been verified to be particularly pertinent due to their approximate separation of the glottal excitation from the vocal tract [6]. The MFCCs are perceptual features computed from the short-term Fourier transform.…”
Section: A Audio Features Representationsmentioning
confidence: 99%