Segmentation of speech using speaker identification

Wilcox, Lynn; Chen, F.; Kimber, Don; Balasubramanian, Vineeth N

doi:10.1109/icassp.1994.389330

Cited by 68 publications

(67 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…(6) Relation to other speech research: Speaker characterization techniques were related to research on improving speech recognition accuracy by speaker adaptation [16], improving synthesized speech quality by adding the natural characteristics of voice individuality, and converting synthesized voice individuality from one speaker to another. Studies on speaker diarization, that is, automatically extracting the speech periods of each person separately ("who spoke when") from a dialogue/conversation/meeting involving more than two people appeared as an extension of speaker recognition technology [21,45,49]. Speaker segmentation and clustering techniques have been used to aid in the adaptation of speech recognizers and for supplying metadata for audio indexing and searching.…”

Section: Smentioning

confidence: 99%

40 Years of Progress in Automatic Speaker Recognition

Furui

2009

Advances in Biometrics

View full text Add to dashboard Cite

Abstract. Research in automatic speaker recognition has now spanned four decades. This paper surveys the major themes and advances made in the past 40 years of research so as to provide a technological perspective and an appreciation of the fundamental progress that has been accomplished in this important area of speech-based human biometrics. Although many techniques have been developed, many challenges have yet to be overcome before we can achieve the ultimate goal of creating human-like machines. Such a machine needs to be able to deliver satisfactory performance under a broad range of operating conditions. A much greater understanding of the human speech process is still required before automatic speaker recognition systems can approach human performance.

show abstract

Section: Smentioning

confidence: 99%

40 Years of Progress in Automatic Speaker Recognition

Furui

2009

Advances in Biometrics

View full text Add to dashboard Cite

show abstract

“…Diarization is the task of automatically identifying sections of spoken audio and correctly labeling them with their characteristics, for example, speech, non-speech, male-speech, female-speech, music, noise. Although speaker identification played a role in early segmentation approaches, e.g., [300], determination of the identity of the speaker, called speaker identification, or confirmation of a presumed speaker identity, called speaker verification, does not fall into the scope of the diarization task.…”

Section: Diarizationmentioning

confidence: 99%

Automatic Summarization

Larson

2012

FNT in Information Retrieval

View full text Add to dashboard Cite

“…In this paper we use a blind clustering approach described in [12] to generate homogeneous regions with no prior knowledge of the hypothesized speaker. For speaker detection, we score each homogeneous region as in the single-speaker case and then take the maximum score as the overall detection score.…”

Section: External Segmentationmentioning

confidence: 99%