2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) 2015
DOI: 10.1109/apsipa.2015.7415289
|View full text |Cite
|
Sign up to set email alerts
|

Music removal by convolutional denoising autoencoder in speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
13
0
1

Year Published

2016
2016
2020
2020

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 27 publications
(15 citation statements)
references
References 16 publications
1
13
0
1
Order By: Relevance
“…An MLP based imputation approach was presented for MCAR missingness in [10] and also outperformed other statistical models. A Convolutional Denoising Autoencoder model which did not impute missing data but denoised audio signals was presented in [15]. A Denoising Autoencoder with more units in the encoder layer than input layer was presented in [5] and achieved good imputation results against MICE.…”
Section: Related Workmentioning
confidence: 99%
“…An MLP based imputation approach was presented for MCAR missingness in [10] and also outperformed other statistical models. A Convolutional Denoising Autoencoder model which did not impute missing data but denoised audio signals was presented in [15]. A Denoising Autoencoder with more units in the encoder layer than input layer was presented in [5] and achieved good imputation results against MICE.…”
Section: Related Workmentioning
confidence: 99%
“…The decoder part reconstructs the input data from the low dimensional features. Convolutional denoising autoencoders (CDAEs) are similar to CAEs but CDAEs are trained from corrupted input signals and the encoder is used to extract noise robust features that the decoder can use to reconstruct a cleaned-up version of the input data [19,20]. The encoder part in CDAEs is composed of repetitions of a convolutional layer, an activation layer, and a pooling layer as shown in Fig.…”
Section: Fully Convolutional Denoising Autoencodersmentioning
confidence: 99%
“…CDAEs differ from conventional DAEs as their parameters (weights) are shared, which makes the CDAEs have fewer parameters than DAEs. The ability of CDAEs to extract repeating patterns in the input makes them suitable to be used to extract speech signals from background noise and music signals for speech enhancement and recognition [19,20].…”
Section: Introductionmentioning
confidence: 99%
“…Recently, CNN has also been applied to speech recognition [12,14] where again it achieved better recognition accuracy than DNN. Meanwhile, Zhao et al [15] proposed a music removal model based on CNN for speech recognition and obtained better recognition results compared with DNN. Hui et al also employed CNN to separate speech and noise by estimating the ideal ratio mask of the time-frequency units [16].…”
Section: Introductionmentioning
confidence: 99%