Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-1619
|View full text |Cite
|
Sign up to set email alerts
|

Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7

Relationship

3
4

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…In [11] unlabeled ATC speech is employed in semi-supervised learning to decrease word error rates. Boosting of contextual knowledge during and after decoding has also been explored in [21,22,23].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In [11] unlabeled ATC speech is employed in semi-supervised learning to decrease word error rates. Boosting of contextual knowledge during and after decoding has also been explored in [21,22,23].…”
Section: Related Workmentioning
confidence: 99%
“…ATCO2-Test: development and evaluation set available as open-source and presented at Interspeech 2021 [11,21]. The data consists of ATC communications from different airports located in Australia, Czech Republic, Slovakia and, Switzerland (see ATCO2 website 3 ).…”
Section: Datasets and Experimental Setupmentioning
confidence: 99%
“…Various works have already investigated context incorporation in the ASR [5,6,7], which marks the prior step in the ATC speech processing pipeline. Two other works of the ATCO2 project [8,9] show that the combination of HCLG and lattice boosting using Kaldi [10], reduces the ATC-ASR errors, especially for the call-signs. We build on top of these works by extracting the (erroneous) call-signs from the ASR transcripts and map them to the standardized ICAO format.…”
Section: Related Workmentioning
confidence: 99%
“…The description of acoustic ELD based on stateof-the-art x-vectors is given in Section 3. As the speech-to-text technology is one of our building blocks, we briefly discuss it in Section 4, we kindly ask the reader to follow [11] for more information. Description of various language detection systems based on the ASR output is presented in Section 5.…”
Section: Motivationmentioning
confidence: 99%
“…A more detailed description of the ASR systems is out of the scope of this paper. The reader is kindly asked to find the details in [11].…”
Section: Speech-to-textmentioning
confidence: 99%