Masami Akamine scite author profile

Masami Akamine

3Publications

57Citation Statements Received

67Citation Statements Given

How they've been cited

106

How they cite others

Affiliations

Toshiba (Japan), Toshiba (United Kingdom)

Publications

Order By: Most citations

Voice Activity Detection: Merging Source and Filter-based Information

Drugman

Stylianou

Kida

et al. 2016

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

Voice Activity Detection (VAD) refers to the problem of distinguishing speech segments from background noise. Numerous approaches have been proposed for this purpose. Some are based on features derived from the power spectral density, others exploit the periodicity of the signal. The goal of this paper is to investigate the joint use of source and filter-based features. Interestingly, a mutual information-based assessment shows superior discrimination power for the source-related features, especially the proposed ones. The features are further the input of an artificial neural network-based classifier trained on a multi-condition database. Two strategies are proposed to merge source and filter information: feature and decision fusion. Our experiments indicate an absolute reduction of 3% of the equal error rate when using decision fusion. The final proposed system is compared to four state-of-the-art methods on 150 minutes of data recorded in real environments. Thanks to the robustness of its source-related features, its multi-condition training and its efficient information fusion, the proposed system yields over the best state-of-the-art VAD a substantial increase of accuracy across all conditions (24% absolute on average).

show abstract

Complex cepstrum for statistical parametric speech synthesis

Maia

Akamine

Gales

2013

Speech Communication

View full text Add to dashboard Cite

Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?

Latorre

Gales

Buchholz

et al. 2011

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Masami Akamine

Voice Activity Detection: Merging Source and Filter-based Information

Complex cepstrum for statistical parametric speech synthesis

Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?

Contact Info

Product

Resources

About