2013
DOI: 10.1121/1.4809770
|View full text |Cite
|
Sign up to set email alerts
|

Enhanced voice activity detection in kernel subspace domain

Abstract: This paper proposes a voice activity detection (VAD) method in a kernel subspace domain to improve the performance of the kernelbased VAD. A linear transform matrix that can simultaneously diagonalize the two covariance matrices using kernel principal component analysis is presented to generate the kernel subspace. The likelihood ratio test based on Gaussian distributions is applied for the VAD in the kernel subspace. Experimental results show that the proposed VAD algorithm outperforms the conventional approa… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2015
2015
2016
2016

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…Recent work in automatic VOT detection has achieved accuracy levels that are similar to human precision levels by combining multidimensional feature extraction from speech signals and machine learning (Lin & Wang, 2011 ; Sonderegger & Keshet, 2012 ). Similarly, feature extraction and machine learning techniques have been applied to improve the performance of VAD in several fields (Kim, Chin, & Chang, 2013 ; Park et al 2014 ). These studies therefore support the present approach toward the automatic extraction of onset latencies from human speech in the context of behavioral experiments.…”
Section: Discussionmentioning
confidence: 99%
“…Recent work in automatic VOT detection has achieved accuracy levels that are similar to human precision levels by combining multidimensional feature extraction from speech signals and machine learning (Lin & Wang, 2011 ; Sonderegger & Keshet, 2012 ). Similarly, feature extraction and machine learning techniques have been applied to improve the performance of VAD in several fields (Kim, Chin, & Chang, 2013 ; Park et al 2014 ). These studies therefore support the present approach toward the automatic extraction of onset latencies from human speech in the context of behavioral experiments.…”
Section: Discussionmentioning
confidence: 99%