2022 International Workshop on Acoustic Signal Enhancement (IWAENC) 2022
DOI: 10.1109/iwaenc53105.2022.9914772
|View full text |Cite
|
Sign up to set email alerts
|

Informed vs. Blind beamforming in AD-HOC Acoustic Sensor Networks for Meeting Transcription

Abstract: We propose a diarization system, that estimates "who spoke when" based on spatial information, to be used as a front-end of a meeting transcription system running on the signals gathered from an acoustic sensor network (ASN). Although the spatial distribution of the microphones is advantageous, exploiting the spatial diversity for diarization and signal enhancement is challenging, because the microphones' positions are typically unknown, and the recorded signals are initially unsynchronized in general. Here, w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 32 publications
0
3
0
Order By: Relevance
“…where 1/(1 + ϵ m ) is approximated by 1 − ϵ m under the assumption of ϵ m ≪ 1 [13]. In (2), the effect of STO is modeled by a time-invariant shift of r 0 ∆ m samples.…”
Section: A Linear Phase Drift (Lpd) Model In T-f Domainmentioning
confidence: 99%
See 2 more Smart Citations
“…where 1/(1 + ϵ m ) is approximated by 1 − ϵ m under the assumption of ϵ m ≪ 1 [13]. In (2), the effect of STO is modeled by a time-invariant shift of r 0 ∆ m samples.…”
Section: A Linear Phase Drift (Lpd) Model In T-f Domainmentioning
confidence: 99%
“…WASNs can utilize common devices, including smartphones and tablets, to construct a microphone array without a dedicated multichannel analog-to-digital converter (ADC). Motivated by this flexibility and handiness, various audio technologies have been developed on the basis of WASNs such as speech enhancement [3]- [6], sound source separation [7]- [10], meeting transcription [11]- [13], and acoustic scene analysis [14], [15].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation