Yu Nasu scite author profile

Yu Nasu

4Publications

23Citation Statements Received

65Citation Statements Given

How they've been cited

How they cite others

Affiliations

Toshiba (South Korea), Tokyo Institute of Technology

Publications

Order By: Most citations

Cross-Channel Spectral Subtraction for meeting speech recognition

Nasu

Sairyo

Furui

2011

View full text Add to dashboard Cite

We propose Cross-Channel Spectral Subtraction (CCSS), a source separation method for recognizing meeting speech where one microphone is prepared for each speaker. The method quickly adapts to changes in transfer functions and uses spectral subtraction to suppress the speech of other speakers. Compared with conventional source separation methods based on independent component analysis (ICA) or that use binary masks, it requires less computational costs and the resulting speech signals have less distortion. In a recognition task of computer-simulated, partially-overlapped speech, CCSS improved the word accuracy from 66.5% to 77.7%. It also significantly improved the recognition accuracy of speech data in actual meetings.

show abstract

Emotional transplant in statistical speech synthesis based on emotion additive model

Ohtani

Nasu

Morita

2015

View full text Add to dashboard Cite

Detection of overlapped speech using lapel microphones in meeting

Yokoyama

Nasu

Iwano

et al. 2013

Speech Communication

View full text Add to dashboard Cite

We propose an overlapped speech detection method for speech recognition and speaker diarization of meetings, where each speaker wears a lapel microphone. Two novel features are utilized as inputs for a GMM-based detector. One is speech power after cross-channel spectral subtraction which reduces the power from the other speakers. The other is an amplitude spectral cosine correlation coefficient which effectively extracts the correlation of spectral components in a rather quiet condition. We evaluated our method using a meeting speech corpus of four speakers. The accuracy of our proposed method, 75.7%, was significantly better than that of the conventional method, 66.8%, which uses raw speech power and power spectral Pearson's correlation coefficient.

show abstract

Overlapped speech detection in meeting using cross-channel spectral subtraction and spectrum similarity

Yokoyama¹,

Nasu²,

Sairyo³

et al. 2012

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yu Nasu

Cross-Channel Spectral Subtraction for meeting speech recognition

Emotional transplant in statistical speech synthesis based on emotion additive model

Detection of overlapped speech using lapel microphones in meeting

Overlapped speech detection in meeting using cross-channel spectral subtraction and spectrum similarity

Contact Info

Product

Resources

About