Interspeech 2018 2018
DOI: 10.21437/interspeech.2018-1942
|View full text |Cite
|
Sign up to set email alerts
|

Fearless Steps: Apollo-11 Corpus Advancements for Speech Technologies from Earth to the Moon

Abstract: The Apollo Program is one of the most significant benchmarks for technology and innovation in human history. The previously introduced UTD-CRSS Fearless Steps initiative resulted in the digitization of the original analog audio tapes recorded during the Apollo Space Missions. The entire speech data for the Apollo 11 Mission is now being made publicly available with the release of the Fearless Steps Corpus. This corpus consists of a cumulative 19,000 hours of conversational speech spanning over thirty time-sync… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
29
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 27 publications
(29 citation statements)
references
References 10 publications
0
29
0
Order By: Relevance
“…To ensure an equitable distribution of data into training, evaluation, and development sets for the challenge tasks, we have categorized the data based on noise levels, amount of speech content, and amount of silence. Due to the long silence durations for some channels, and based on importance of the mission, the speech activity density of the corpus varies throughout the mission [1]. A total of 80 hours of audio are provided for task system development.…”
Section: Challenge Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…To ensure an equitable distribution of data into training, evaluation, and development sets for the challenge tasks, we have categorized the data based on noise levels, amount of speech content, and amount of silence. Due to the long silence durations for some channels, and based on importance of the mission, the speech activity density of the corpus varies throughout the mission [1]. A total of 80 hours of audio are provided for task system development.…”
Section: Challenge Datasetmentioning
confidence: 99%
“…The last six years have seen the development of the Corpus consisting of over 19,000 hours of audio data from the Apollo 1, 11, 13 and the Gemini 8 missions. The unique nature of this data posed a serious challenge for analysis using conventional speech technologies [1]. This challenge motivated the development of multiple solutions from CRSS catered to the nature and complexity of the Apollo data [2,3,4,5,6,7].…”
Section: Introductionmentioning
confidence: 99%
“…The Fearless Steps Challenge 1 2019 is organized to evaluate the state-of-the-art methods for different applications with naturalistic audio signals in challenging environments [29,30]. A corpus from the data captured during Apollo-11 Mission is released for the challenge.…”
Section: Introductionmentioning
confidence: 99%
“…The combo SAD has been proposed by the challenge organizers in their previous investigations [33]. The speech recorded in the corpus are unprompted, and hence subject to significant variations in speech characteristics for every speaker [29].…”
Section: Introductionmentioning
confidence: 99%
“…Finally, the currently ongoing VOiCES [7] and Fearless Steps [8] challenges also explore interesting areas. VOiCES focuses on robustness to reverberation and background noises of replayed speech, while Fearless Steps defines a more holistic set of challenges using data from the Apollo-11 mission.…”
Section: Introductionmentioning
confidence: 99%