2021
DOI: 10.1038/s41597-021-01041-3
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal dataset of real-time 2D and static 3D MRI of healthy French speakers

Abstract: The study of articulatory gestures has a wide spectrum of applications, notably in speech production and recognition. Sets of phonemes, as well as their articulation, are language-specific; however, existing MRI databases mostly include English speakers. In our present work, we introduce a dataset acquired with MRI from 10 healthy native French speakers. A corpus consisting of synthetic sentences was used to ensure a good coverage of the French phonetic context. A real-time MRI technology with temporal resolut… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 38 publications
0
4
0
Order By: Relevance
“…These samples were composed of 500–3216 sagittal slices per run, of which 113–2808 were analysed after discarding time points that were unlikely to be of interest (i.e., retaining time points in which audible sound was produced by the participant as identified by synchronised audio data). These samples covered a range of speech and non-speech behaviours across a range of research disciplines (see Table 1 ), including spoken monosyllables in British English, connected speech in German (Carignan et al, 2020 ), French (Isaieva et al, 2021 ), American English as spoken by a native (L1) speaker (Narayanan et al, 2014 ), and American English spoken by a non-native speaker (Lim et al, 2021 ), as well as non-speech vocal behaviours including vocal size exaggeration (Belyk et al, 2022 ), laughter (Belyk & McGettigan, 2022 ), and whistling (Belyk et al, 2019 ). This sample reflects the natural variation in imaging parameters, and correspondingly in image quality, that analysts may face in practical application (see Fig.…”
Section: Methodsmentioning
confidence: 99%
“…These samples were composed of 500–3216 sagittal slices per run, of which 113–2808 were analysed after discarding time points that were unlikely to be of interest (i.e., retaining time points in which audible sound was produced by the participant as identified by synchronised audio data). These samples covered a range of speech and non-speech behaviours across a range of research disciplines (see Table 1 ), including spoken monosyllables in British English, connected speech in German (Carignan et al, 2020 ), French (Isaieva et al, 2021 ), American English as spoken by a native (L1) speaker (Narayanan et al, 2014 ), and American English spoken by a non-native speaker (Lim et al, 2021 ), as well as non-speech vocal behaviours including vocal size exaggeration (Belyk et al, 2022 ), laughter (Belyk & McGettigan, 2022 ), and whistling (Belyk et al, 2019 ). This sample reflects the natural variation in imaging parameters, and correspondingly in image quality, that analysts may face in practical application (see Fig.…”
Section: Methodsmentioning
confidence: 99%
“…To widen access to real-time speech MRI data and therefore stimulate research in the field, several speech MRI datasets that include series of 2D real-time images of a midsagittal slice of the vocal tract have been made publicly available 3 , 5 11 . Most of these datasets include image series of English 5 7 or French 8 , 9 speakers performing phonologically comprehensive speech tasks (i.e. speech tasks designed to include most phonemes in a wide range of contexts).…”
Section: Background and Summarymentioning
confidence: 99%
“…3 Optical microphones are also used during real-time imaging of speech in order to match dynamic images of the vocal tract to sound recordings. [4][5][6] For MR safety applications, specific measurements are often required such as temperature, RF field, electric field, or magnetic field. While modern MRI systems dispose a set of accessory devices, they are usually limited to a narrow bandwidth ECG, a respiratory belt and a finger pulse device.…”
Section: Introductionmentioning
confidence: 99%
“…Other accessory devices, that operate independently of the MR scanner, have been developed for obtaining complementary information about patient motion, such as optical tracking systems, 1 inertial measurement units (including accelerometers, gyroscopes, and possibly magnetometers), 2 or ultrasound devices 3 . Optical microphones are also used during real‐time imaging of speech in order to match dynamic images of the vocal tract to sound recordings 4–6 . For MR safety applications, specific measurements are often required such as temperature, RF field, electric field, or magnetic field.…”
Section: Introductionmentioning
confidence: 99%