Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing
DOI: 10.1109/icassp.1994.389330
|View full text |Cite
|
Sign up to set email alerts
|

Segmentation of speech using speaker identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
60
0

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 68 publications
(67 citation statements)
references
References 7 publications
0
60
0
Order By: Relevance
“…(6) Relation to other speech research: Speaker characterization techniques were related to research on improving speech recognition accuracy by speaker adaptation [16], improving synthesized speech quality by adding the natural characteristics of voice individuality, and converting synthesized voice individuality from one speaker to another. Studies on speaker diarization, that is, automatically extracting the speech periods of each person separately ("who spoke when") from a dialogue/conversation/meeting involving more than two people appeared as an extension of speaker recognition technology [21,45,49]. Speaker segmentation and clustering techniques have been used to aid in the adaptation of speech recognizers and for supplying metadata for audio indexing and searching.…”
Section: Smentioning
confidence: 99%
“…(6) Relation to other speech research: Speaker characterization techniques were related to research on improving speech recognition accuracy by speaker adaptation [16], improving synthesized speech quality by adding the natural characteristics of voice individuality, and converting synthesized voice individuality from one speaker to another. Studies on speaker diarization, that is, automatically extracting the speech periods of each person separately ("who spoke when") from a dialogue/conversation/meeting involving more than two people appeared as an extension of speaker recognition technology [21,45,49]. Speaker segmentation and clustering techniques have been used to aid in the adaptation of speech recognizers and for supplying metadata for audio indexing and searching.…”
Section: Smentioning
confidence: 99%
“…Diarization is the task of automatically identifying sections of spoken audio and correctly labeling them with their characteristics, for example, speech, non-speech, male-speech, female-speech, music, noise. Although speaker identification played a role in early segmentation approaches, e.g., [300], determination of the identity of the speaker, called speaker identification, or confirmation of a presumed speaker identity, called speaker verification, does not fall into the scope of the diarization task.…”
Section: Diarizationmentioning
confidence: 99%
“…In this paper we use a blind clustering approach described in [12] to generate homogeneous regions with no prior knowledge of the hypothesized speaker. For speaker detection, we score each homogeneous region as in the single-speaker case and then take the maximum score as the overall detection score.…”
Section: External Segmentationmentioning
confidence: 99%