2013
DOI: 10.1109/tasl.2012.2231072
|View full text |Cite
|
Sign up to set email alerts
|

Single-Channel Speech-Music Separation for Robust ASR With Mixture Models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(12 citation statements)
references
References 30 publications
0
12
0
Order By: Relevance
“…The vector will be approximately sparse, providing the clipping level is high enough [17]. 2) Single channel denoising: In many applications people wish to separate clean speeches from various kinds of interferences like music noise, babble noise and vehicle noise [27], [31], etc. This is a typical SCSS problem and we need to learn from previous speeches and learn from noise data.…”
Section: Our Analysis Of the Scss Problemmentioning
confidence: 99%
See 1 more Smart Citation
“…The vector will be approximately sparse, providing the clipping level is high enough [17]. 2) Single channel denoising: In many applications people wish to separate clean speeches from various kinds of interferences like music noise, babble noise and vehicle noise [27], [31], etc. This is a typical SCSS problem and we need to learn from previous speeches and learn from noise data.…”
Section: Our Analysis Of the Scss Problemmentioning
confidence: 99%
“…However, the RIP condition guarantees a successful recovery of sparse vector providing it is sparse enough. The recovery task is fulfilled by solving the following sparse optimization problem (29) or its equivalent form (30) With a specified confidence level factor , problem (30) can be rewritten as (31) which can be easily solved by the OMP algorithm. Finally the impulsive samples are updated as (32) The remaining tiny Gaussian noise can be suppressed by a simple spectral subtraction approach.…”
Section: A the Fodar-domp Algorithm Based On Rtfdmentioning
confidence: 99%
“…Speech separation aims at extracting the speech signals of each speaker in a noisy mixture. It has many applications, for example in automatic speech recognition [1], hearing aids [2] or music processing [3]. In recent years, deep neural network (DNN)-based solutions have replaced model-based approaches because of the great progress they enabled [4][5][6][7][8].…”
Section: Introductionmentioning
confidence: 99%
“…Musical sound sources also often corrupt speech signals, which is relevant for separating speech in movies, radio shows, or home speaker speech recognition. The speech-music separation task has mainly been studied in simplified settings so far [2,3].…”
Section: Introductionmentioning
confidence: 99%