1990
DOI: 10.1016/0167-6393(90)90010-7
|View full text |Cite
|
Sign up to set email alerts
|

Speech database development at MIT: Timit and beyond

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
236
0

Year Published

1997
1997
2023
2023

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 508 publications
(249 citation statements)
references
References 1 publication
2
236
0
Order By: Relevance
“…Four speech intervals were replaced with noise: Cs, Vs, CVs, and VCs. The TIMIT database provides phonetic classification to distinguish American English consonant and vowels (45)(46)(47); this classification was adopted with one modification. When an interval marked "closure" (labeled separately in the TIMIT database) preceded one labeled stop consonant or affricate, these intervals were combined and treated as a single consonant.…”
Section: Methodsmentioning
confidence: 99%
“…Four speech intervals were replaced with noise: Cs, Vs, CVs, and VCs. The TIMIT database provides phonetic classification to distinguish American English consonant and vowels (45)(46)(47); this classification was adopted with one modification. When an interval marked "closure" (labeled separately in the TIMIT database) preceded one labeled stop consonant or affricate, these intervals were combined and treated as a single consonant.…”
Section: Methodsmentioning
confidence: 99%
“…The single word stimuli in the database include repetitions of English digits, the international radio alphabets, the 20 most frequent words in the British National Corpus (BNC), and a set of words selected by Kent et al to demonstrate relevant phonetic contrasts [9]. The sentence stimuli are derived from the Yorkston-Beukelman assessment of intelligibility [10] and the TIMIT database [11]. In addition, each participant is asked to describe the contents of a few photographs that are selected from standardized tests of linguistic ability in his/her own words so as to include dictationstyle speech into the database.…”
Section: Description Of Datamentioning
confidence: 99%
“…The only existing corpus of regional variation in the United States that obtained high quality audio recordings in a sound-attenuated booth is the TIMIT Acoustic-Phonetic Continuous Speech Corpus (Fisher et al, 1986;Zue et al, 1990). The TIMIT corpus contains recordings of 630 talkers who each read 10 different sentences.…”
Section: Existing Spoken Language Corpora With Dialect Variationmentioning
confidence: 99%