Alex Stupakov scite author profile

Alex Stupakov

4Publications

50Citation Statements Received

20Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Washington, Seattle University

Publications

Order By: Most citations

COSINE - A corpus of multi-party COnversational Speech In Noisy Environments

Stupakov

Hanusa

Bilmes

et al. 2009

View full text Add to dashboard Cite

We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (CO-SINE) corpus. The corpus is a set of multi-party conversations recorded in real world environments with background noise that can be used to train noise-robust speech recognition systems. We explain the motivation for creating such a corpus and describe the resulting audio recordings and transcriptions that comprise the corpus. These recordings include a 4-channel array and close-talking, far-field, and throat microphones on separate synchronized channels, allowing for unique algorithm research.

show abstract

The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments

Stupakov

Hanusa

Vijaywargi

et al. 2012

Computer Speech & Language

View full text Add to dashboard Cite

Improving multi-lattice alignment based spoken keyword spotting

Lin

Stupakov

Bilmes

2009

View full text Add to dashboard Cite

Spoken keyword spotting via multi-lattice alignment

Lin

Stupakov

Bilmes

2008

View full text Add to dashboard Cite

We propose a method for finding keywords in an audio database using a spoken query. Our method is based on performing a joint alignment between a phone lattice generated from a spoken utterance query and a second phone lattice representing a long utterance needing to be searched. We implement this joint alignment procedure in a graphical models framework. We evaluate our system on TIMIT as well as on the Switchboard conversational telephone speech (CTS) corpus. Our results show that a phone lattice representation of the spoken query achieves higher performance than using only the 1-best phone sequence representation.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Alex Stupakov

COSINE - A corpus of multi-party COnversational Speech In Noisy Environments

The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments

Improving multi-lattice alignment based spoken keyword spotting

Spoken keyword spotting via multi-lattice alignment

Contact Info

Product

Resources

About