The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639095
|View full text |Cite
|
Sign up to set email alerts
|

Voice activity detection using convolutive non-negative sparse coding

Abstract: This paper presents a voice activity detection (VAD) approach using convolutive non-negative sparse coding (CNSC) to improve the detection performance in low signal-to-noise (SNR) conditions. Our idea is to use noise-robust feature for speech signal detection while noise is reduced away. We first use magnitude spectrum as the non-negative and additive lowlevel representation of audio signals, and learn a speech dictionary from clean speech as well as a noise dictionary from noise samples. Then, the two diction… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2016
2016

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 21 publications
(25 reference statements)
0
1
0
Order By: Relevance
“…The difference is that the NSC concatenates sequential many frames to be a matrix and sets the constraint that the dictionary and the representations are non-negative. Recently, NSC with a speech dictionary and a noise dictionary can be found in speech recognition [39] and voice activity detection [49].…”
Section: Outline Of Sparse Codingmentioning
confidence: 99%
“…The difference is that the NSC concatenates sequential many frames to be a matrix and sets the constraint that the dictionary and the representations are non-negative. Recently, NSC with a speech dictionary and a noise dictionary can be found in speech recognition [39] and voice activity detection [49].…”
Section: Outline Of Sparse Codingmentioning
confidence: 99%