Robust statistical processing of TDOA estimates for distant speaker diarization

Parada, Pablo Peso; Sharma, Dushyant; Waterschoot, Toon van; Naylor, Patrick A.

doi:10.23919/eusipco.2017.8081174

Cited by 3 publications

(1 citation statement)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this paper, we consider the diarization of audio recordings using spatial features alone. Several solutions have been proposed utilizing spatial features, which use the time-difference-of-arrival (TDOA) features [4,5,6,7]. However, the estimation of TDOA is sensitive to reverberation and noise.…”

Section: Introductionmentioning

confidence: 99%

Latent variable approach to diarization of audio recordings using ad-hoc randomly placed mobile devices

Chetupalli,

Bhowmick,

Sreenivas

2018

Preprint

View full text Add to dashboard Cite

Diarization of audio recordings from ad-hoc mobile devices using spatial information is considered in this paper. A twochannel synchronous recording is assumed for each mobile device, which is used to compute directional statistics separately at each device in a frame-wise manner. The recordings across the mobile devices are asynchronous, but a coarse synchronization is performed by aligning the signals using acoustic events, or real-time clock. Direction statistics computed for all the devices, are then modeled jointly using a Dirichlet mixture model, and the posterior probability over the mixture components is used to derive the diarization information. Experiments on real life recordings using mobile phones show a diarization error rate of less than 14%.

show abstract

Section: Introductionmentioning

confidence: 99%