Informed vs. Blind beamforming in AD-HOC Acoustic Sensor Networks for Meeting Transcription

Gburrek, Tobias; Schmalenstroeer, Joerg; Heitkaemper, Jens; Haeb‐Umbach, Reinhold

doi:10.1109/iwaenc53105.2022.9914772

Cited by 3 publications

(3 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…where 1/(1 + ϵ m ) is approximated by 1 − ϵ m under the assumption of ϵ m ≪ 1 [13]. In (2), the effect of STO is modeled by a time-invariant shift of r 0 ∆ m samples.…”

Section: A Linear Phase Drift (Lpd) Model In T-f Domainmentioning

confidence: 99%

“…WASNs can utilize common devices, including smartphones and tablets, to construct a microphone array without a dedicated multichannel analog-to-digital converter (ADC). Motivated by this flexibility and handiness, various audio technologies have been developed on the basis of WASNs such as speech enhancement [3]- [6], sound source separation [7]- [10], meeting transcription [11]- [13], and acoustic scene analysis [14], [15].…”

Section: Introductionmentioning

confidence: 99%

“…Blind SRO estimation, which uses only observed signals to estimate SROs, is desirable in various scenarios [11]- [13]. It is a challenging task owing to the lack of time stamps, calibration signals, and prior information on microphone and source positions.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Efficient Joint Optimization of Sampling Rate Offsets Using Entire Multichannel Signal

Masuyama,

Yamaoka,

Kawamura

et al. 2024

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

In this paper, we propose a joint estimation method for the sampling rate offsets (SROs) of multiple recording devices. In wireless acoustic sensor networks, distributed microphones are connected to different analog-to-digital converters, and thus SROs occur on non-reference channels, which degrades the performance of various array signal processing techniques. To address this problem, we propose to jointly estimate and compensate SROs of all the non-reference channels. Since the proposed method is formulated as a multivariate non-convex optimization problem, we derive an efficient optimization algorithm on the basis of the majorization-minimization and majorizationequalization algorithms. We further propose to update SROs with only low-frequency components in the initial iterations to avoid undesired local optima. Our experimental results validate the effectiveness of the joint estimation of all SROs and demonstrate its advantage in subsequent array signal processing.

show abstract

“…where 1/(1 + ϵ m ) is approximated by 1 − ϵ m under the assumption of ϵ m ≪ 1 [13]. In (2), the effect of STO is modeled by a time-invariant shift of r 0 ∆ m samples.…”

Section: A Linear Phase Drift (Lpd) Model In T-f Domainmentioning

confidence: 99%