Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing
DOI: 10.1109/icassp.1994.389337
|View full text |Cite
|
Sign up to set email alerts
|

A quantitative assessment of the relative speaker discriminating properties of phonemes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
27
0
1

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 48 publications
(30 citation statements)
references
References 4 publications
2
27
0
1
Order By: Relevance
“…However, we need to fix the length of the test segment to be equal for all classes for fair comparison. Previous studies have fixed the length of the test segment [1], [3] or showed the results as an average likelihood versus the number of frames in each phoneme [2] for a similar reason. Thus, we also perform speaker identification test with a fixed test length.…”
Section: Controlled Condition: Same Test Lengthmentioning
confidence: 99%
See 2 more Smart Citations
“…However, we need to fix the length of the test segment to be equal for all classes for fair comparison. Previous studies have fixed the length of the test segment [1], [3] or showed the results as an average likelihood versus the number of frames in each phoneme [2] for a similar reason. Thus, we also perform speaker identification test with a fixed test length.…”
Section: Controlled Condition: Same Test Lengthmentioning
confidence: 99%
“…We can estimate that if the amount of transitions is similar to the amount of vowels, transitions may achieve a better result than vowels. Nasals, which are well-known as a good feature for speaker recognition [1] are the third but its error rate is much worse than vowels and transitions. This is because the total length of the nasal tokens in an ordinary sentence is much less than the length of vowels.…”
Section: Practical Condition: One Test Sentencementioning
confidence: 99%
See 1 more Smart Citation
“…Various approaches have been proposed to overcome this limitation by utilizing phoneme information. [4][5][6][7] In this paper, we also propose a method that utilizes phoneme information to improve speaker recognition performance. While previous studies focus on using separate models for each phoneme and combining scores, 4,6,7 we focus on finding an optimal phoneme class ratio, the portion of each phoneme class in an utterance, that maximizes speaker recognition performance based on mutual information.…”
Section: Introductionmentioning
confidence: 99%
“…One of a number of studies [5] indicated that not all phonemes are equally discriminative, but in addition, this also implies that not all areas of the acoustic feature space are equally discriminative. The use of a secondary classifier presents a simple solution for evaluating the weight of evidence from the speaker's score and the consideration of the discriminative power of each partitioned region of the feature space.…”
Section: Secondary Classificationmentioning
confidence: 99%