Erik Bresch scite author profile

USC-TIMIT is an extensive database of multimodal speech production data, developed to complement existing resources available to the speech research community and with the intention of being continuously refined and augmented. The database currently includes real-time magnetic resonance imaging data from five male and five female speakers of American English. Electromagnetic articulography data have also been presently collected from four of these speakers. The two modalities were recorded in two independent sessions while the subjects produced the same 460 sentence corpus used previously in the MOCHA-TIMIT database. In both cases the audio signal was recorded and synchronized with the articulatory data. The database and companion software are freely available to the research community.

show abstract

Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans

Bresch

Nielsen

Nayak

et al. 2006

111

View full text Add to dashboard Cite

This letter describes a data acquisition setup for recording, and processing, running speech from a person in a magnetic resonance imaging (MRI) scanner. The main focus is on ensuring synchronicity between image and audio acquisition, and in obtaining good signal to noise ratio to facilitate further speech analysis and modeling. A field-programmable gate array based hardware design for synchronizing the scanner image acquisition to other external data such as audio is described. The audio setup itself features two fiber optical microphones and a noise-canceling filter. Two noise cancellation methods are described including a novel approach using a pulse sequence specific model of the gradient noise of the MRI scanner. The setup is useful for scientific speech production studies. Sample results of speech and singing data acquired and processed using the proposed method are given.

show abstract

Seeing speech: Capturing vocal tract shaping using real-time magnetic resonance imaging [Exploratory DSP]

Bresch

Kim

Nayak³

et al. 2008

IEEE Signal Process. Mag.

100

View full text Add to dashboard Cite

Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images

Bresch

Narayanan

2009

IEEE Trans. Med. Imaging

View full text Add to dashboard Cite

We describe a method for unsupervised region segmentation of an image using its spatial frequency domain representation. The algorithm was designed to process large sequences of real-time magnetic resonance (MR) images containing the 2-D midsagittal view of a human vocal tract airway. The segmentation algorithm uses an anatomically informed object model, whose fit to the observed image data is hierarchically optimized using a gradient descent procedure. The goal of the algorithm is to automatically extract the time-varying vocal tract outline and the position of the articulators to facilitate the study of the shaping of the vocal tract during speech production.

show abstract

Timing effects of syllable structure and stress on nasals: A real-time MRI examination

Byrd¹,

Tobin²,

Bresch

et al. 2009

Journal of Phonetics

View full text Add to dashboard Cite

The coordination of velum and oral gestures for English [n] is studied using real-time MRI movies to reconstruct vocal tract aperture functions. This technique allows for the examination of parts of the vocal tract otherwise inaccessible to dynamic imaging or movement tracking. The present experiment considers syllable onset, coda, and juncture geminate nasals and also addresses the effects of a variety of word stress patterns on segment internal coordination. We find a bimodal timing pattern in which near-synchrony of velum lowering and tongue tip raising characterizes the timing for onsets and temporal lag between the gestures is characteristic for codas, supporting and extending the findings of Krakow (1989Krakow ( ), 1993 for [m]. Intervocalic word-internal nasals are found to have timing patterns that are sensitive to the local stress context, which suggests the presence of an underlying timing specification that can yield flexibly.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Erik Bresch

Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC)

Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans

Seeing speech: Capturing vocal tract shaping using real-time magnetic resonance imaging [Exploratory DSP]

Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images

Timing effects of syllable structure and stress on nasals: A real-time MRI examination

Contact Info

Product

Resources

About