2000
DOI: 10.1006/dspr.1999.0356
|View full text |Cite
|
Sign up to set email alerts
|

Speaker Verification by Human Listeners: Experiments Comparing Human and Machine Performance Using the NIST 1998 Speaker Evaluation Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
31
2

Year Published

2001
2001
2018
2018

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 51 publications
(36 citation statements)
references
References 9 publications
1
31
2
Order By: Relevance
“…This exploratory study can help shed light on the acoustic cues used by the human auditory system for spoofing detection. There are a few interesting comparisons of human performance to that of ASV systems [88], [89], [90], but only one attempt to compare human performance to automatic spoofing detection [91]. This study compares the performance of 100 native English listeners to an automatic approach using a combination of MFCCand CNFP-based detectors.…”
Section: Table V Comparative Performance Of Several Post-evaluation mentioning
confidence: 99%
“…This exploratory study can help shed light on the acoustic cues used by the human auditory system for spoofing detection. There are a few interesting comparisons of human performance to that of ASV systems [88], [89], [90], but only one attempt to compare human performance to automatic spoofing detection [91]. This study compares the performance of 100 native English listeners to an automatic approach using a combination of MFCCand CNFP-based detectors.…”
Section: Table V Comparative Performance Of Several Post-evaluation mentioning
confidence: 99%
“…On the other hand, similar to speech recognition tasks, human listeners perform robustly in speaker recognition tasks [28]. The human ability to function well in noisy acoustic environments is due to a perceptual process termed auditory scene analysis (ASA) [2].…”
Section: Introductionmentioning
confidence: 99%
“…Traditionally, automatic speaker recognition systems have relied mostly on low-level characteristics by using short-term features related to the spectrum of the voice, associated mainly to the physical traits of the vocal apparatus. However, humans rely on several linguistic levels contained in the speech signal in order to identify people from voice alone [1]: the voice timbre, a characteristic laugh, specific and repeatedly used words, etc. In contrast to the spectral level, these linguistic features are mainly related to the learned habits and style.…”
Section: Introductionmentioning
confidence: 99%