2010 IEEE International Conference on Acoustics, Speech and Signal Processing 2010
DOI: 10.1109/icassp.2010.5495086
|View full text |Cite
|
Sign up to set email alerts
|

Multistream speaker diarization beyond two acoustic feature streams

Abstract: Speaker diarization for meetings data are recently converging towards multistream systems. The most common complementary features used in combination with MFCC are Time Delay of Arrival (TDOA). Also other features have been proposed although, there are no reported improvements on top of MFCC+TDOA systems. In this work we investigate the combination of other feature sets along with MFCC+TDOA. We discuss issues and problems related to the weighting of four different streams proposing a solution based on a smooth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
14
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 9 publications
(14 citation statements)
references
References 9 publications
0
14
0
Order By: Relevance
“…Towards this end, the paper exthends the Information Bottleneck combination [6]. The method forms the space of relevance variables, necessary for the aIB clustering, using the output of two separate diarization systems.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Towards this end, the paper exthends the Information Bottleneck combination [6]. The method forms the space of relevance variables, necessary for the aIB clustering, using the output of two separate diarization systems.…”
Section: Conclusion and Discussionmentioning
confidence: 99%
“…6 12.4 aIB model-based combination outperforms the HMM/GMM system by 0.8% achieving state-of-the-art results; this system will be used as baseline and the proposed techniques will be benchmarked w.r.t. it.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations