Proceedings of the 8th Workshop on Asian Translation (WAT2021) 2021
DOI: 10.18653/v1/2021.wat-1.10
|View full text |Cite
|
Sign up to set email alerts
|

BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text

Abstract: With the growing popularity of smart speakers, such as Amazon Alexa, speech is becoming one of the most important modes of humancomputer interaction. Automatic speech recognition (ASR) is arguably the most critical component of such systems, as errors in speech recognition propagate to the downstream components and drastically degrade the user experience. A simple and effective way to improve the speech recognition accuracy is to apply automatic post-processor to the recognition result. However, training a pos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1
1

Relationship

4
4

Authors

Journals

citations
Cited by 21 publications
(18 citation statements)
references
References 31 publications
0
8
0
Order By: Relevance
“…The reason for choosing the commercialization system for comparison is that it is a certified system used by several researchers, and the latest deep learning-based grammatical correction methodology is applied; hence, it is the most objective and reliable system for accurate analysis. The performance of each corrector is measured by using the error sentences of K-NCT as input for three commercialization systems and performing quantitative analysis using the BLEU score [20] and GLEU score [21], which are used in various deep learning-based grammatical correction studies as evaluation indicators [1,22,23]. The experimental results are shown in Table 4.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…The reason for choosing the commercialization system for comparison is that it is a certified system used by several researchers, and the latest deep learning-based grammatical correction methodology is applied; hence, it is the most objective and reliable system for accurate analysis. The performance of each corrector is measured by using the error sentences of K-NCT as input for three commercialization systems and performing quantitative analysis using the BLEU score [20] and GLEU score [21], which are used in various deep learning-based grammatical correction studies as evaluation indicators [1,22,23]. The experimental results are shown in Table 4.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…It is because there are numerous negative effects in terms of data imbalance. This suggests in which direction we should build data and informs us that the performances can be improved through data cleaning such as PCF (Koehn et al, 2020a;Park et al, 2021c).…”
Section: Resultsmentioning
confidence: 99%
“…Currently, data augmentation is performed only on TOEIC Part 5 to improve performance, so it cannot be applied to other parts of the TOEIC yet. Therefore, in the future, it will be developed to cover all parts of TOEIC, and experiments will be conducted to examine whether the proposed method can improve performance even in other domain tasks [29,30].…”
Section: Discussionmentioning
confidence: 99%