Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-1946
|View full text |Cite
|
Sign up to set email alerts
|

Improved Speech Enhancement Using a Time-Domain GAN with Mask Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
2
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 16 publications
0
2
0
1
Order By: Relevance
“…Mapping-based and masking-based methods are another categorization of speech enhancement approaches which are used in [1], [2], [3], [4] and [5], [6], [7] respectively. In the mapping-based approach, the algorithm attempts to figure out how to connect the clean target audio and the noisy input.…”
Section: Speech Enhancement Methods Using Ganmentioning
confidence: 99%
“…Mapping-based and masking-based methods are another categorization of speech enhancement approaches which are used in [1], [2], [3], [4] and [5], [6], [7] respectively. In the mapping-based approach, the algorithm attempts to figure out how to connect the clean target audio and the noisy input.…”
Section: Speech Enhancement Methods Using Ganmentioning
confidence: 99%
“…These speech and noise spectrograms are used to compute the speech mark loss. TIMIT dataset is used and the method outperforms DNN based speech enhancement and SEGAN [14].…”
Section: Time Domain Gan With Mask Learningmentioning
confidence: 99%
“…Although these methods address the false-extraction problem to some extent, they do not focus on distinguish whether the target speaker is present or absent, resulting in suboptimal performance. Some methods introduce additional information to verify the speaker's presence, such as speaker activity information [13,14] or visual cue [7], resulting in limited application of these methods.…”
Section: Introductionmentioning
confidence: 99%