2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2011
DOI: 10.1109/icassp.2011.5947528
|View full text |Cite
|
Sign up to set email alerts
|

Multi-stream spectro-temporal and cepstral features based on data-driven hierarchical phoneme clusters

Abstract: We propose a method to enhance multi-stream Gabor and MFCC features using data-driven hierarchical phoneme clusters to yield more discriminating posteriors. We take into account different hierarchy structures, and in addition perform mean and variance normalization. A relative improvement of 11.5% over the conven tional MFCC Tandem system was achieved in experiments con ducted on Mandarin broadcast news. We analyze the complemen tarity between Gabor and MFCC features for different types of phonemes, and invest… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2013
2013
2013
2013

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 20 publications
0
0
0
Order By: Relevance