ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9746270
|View full text |Cite
|
Sign up to set email alerts
|

Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(2 citation statements)
references
References 41 publications
0
2
0
Order By: Relevance
“…Another well-known framework is target-speaker voice activity detection (TS-VAD) [11], it estimates voice activity of all speakers at the same time with the help of their speaker embeddings. TS-VAD has shown promising performance in many tasks, such as CHiME-6 [2], DIRHARD-III [4], and AliMeeting [12], etc.…”
Section: Introductionmentioning
confidence: 99%
“…Another well-known framework is target-speaker voice activity detection (TS-VAD) [11], it estimates voice activity of all speakers at the same time with the help of their speaker embeddings. TS-VAD has shown promising performance in many tasks, such as CHiME-6 [2], DIRHARD-III [4], and AliMeeting [12], etc.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, there has been a lot of exploration in the field of multi-party meetings scenarios [1,2,3,4,5]. Progress has also been advanced with several challenges [6,7,8,9,10,11] and datasets [12,13,14,15,16] specifically focusing on this field. One major problem of this scenario is the speech overlap.…”
Section: Introductionmentioning
confidence: 99%