2006 IEEE Odyssey - The Speaker and Language Recognition Workshop 2006
DOI: 10.1109/odyssey.2006.248134
|View full text |Cite
|
Sign up to set email alerts
|

Using Data-driven and Phonetic Units for Speaker Verification

Abstract: Esta es la versión de autor de la comunicación de congreso publicada en: This is an author produced version of a paper published in: AbstractRecognition of speaker identity based on modeling the streams produced by phonetic decoders (phonetic speaker recognition) has gained popularity during the past few years. Two of the major problems that arise when phone based systems are being developed are the possible mismatches between the development and evaluation data and the lack of transcribed databases. Data-dri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
5
0

Year Published

2007
2007
2020
2020

Publication Types

Select...
3
1
1

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…ALISP Language Model System ally to spectrally stable portions of the signal. We then compute the gravity center for each segment and train a gender In this system [9], the label sequences produced by the dependent vector quantizer to cluster these centers of grav-ALISP recognizer are used to train ALISP n-gram models ity. The codebook size (64 in our case) defines the number using the HTK Language Model (LM) tools (see 14th chapof ALISP symbols.…”
Section: Complementary Information One Set Of Information Reflectsmentioning
confidence: 99%
See 2 more Smart Citations
“…ALISP Language Model System ally to spectrally stable portions of the signal. We then compute the gravity center for each segment and train a gender In this system [9], the label sequences produced by the dependent vector quantizer to cluster these centers of grav-ALISP recognizer are used to train ALISP n-gram models ity. The codebook size (64 in our case) defines the number using the HTK Language Model (LM) tools (see 14th chapof ALISP symbols.…”
Section: Complementary Information One Set Of Information Reflectsmentioning
confidence: 99%
“…This way the availability of corpora is on the NIST 2006 Speaker Recognition Evaluation data much less an issue and the training corpus can be chosen to show that the data-driven features provide complementary match the working conditions as much as possible. information and the resulting fused system reduced the erThis paper is the continuation of previous attempts ror rate in comparison to the GMM baseline system.to model high-level information using data-driven approaches [9,8]. The focus here is on the fusion of different systems that exploit data-driven high-level source of in-…”
mentioning
confidence: 98%
See 1 more Smart Citation
“…ASR [36] Data-driven temporal filters are designed using PCA, LDA and minimum classification error (MCE) framework. ASR [37] Speech segments are created using a data-driven and automatic language independent speech processing (ALISP). ASV [38] This work uses F-ratio to adjust the center and edge frequencies of the filterbank and the F-ratio is computed for speaker separability.…”
Section: Introductionmentioning
confidence: 99%
“…The units examined in the past include word N-grams, syllables, phones, and Automatic Language Independent Speech Processing (ALISP) units [4] (which are designed to mimic the phones) and MLP-based phonetic units [5]. Many of the units, such as the words and phones, are used only because their transcripts are readily available via Automatic Speech Recognition, and are incorporated without regard to their actual speaker discriminative abilities.…”
Section: Introductionmentioning
confidence: 99%