Decoding Strategies for Improving Low-Resource Machine Translation

Park, Chanjun; Yang, Yeongwook; Park, Kinam; Lim, Heuiseok

doi:10.3390/electronics9101562

Cited by 22 publications

(18 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In particular, the number of training parameters for the APE model, in which d bal is set to 256, is lower than 5% of all the model parameters. This demonstrates that efficient training can be obtained via the BAL structure, and we can obtain an advantage with respect to the GPU usage or training time, while considering industrial services [11,30].…”

Section: E Question 2: Does the Adapter Structure Provide An Advantage In Ape?mentioning

confidence: 99%

An Empirical Study on Automatic Post Editing for Neural Machine Translation

Moon

Park

et al. 2021

IEEE Access

Self Cite

View full text Add to dashboard Cite

Automatic post editing (APE) researches aim to correct errors in the machine translation results. Recently, APE research has mainly been conducted in two directions: noise-based APE and adapterbased APE. This study poses three questions based on existing APE studies and conducted a verification. The first is a question about the optimal APE research direction, and this has been figured out through a comparative analysis of the previous studies on noise-based APE and adapter-based APE. The second is about the substantial effectiveness of the bottleneck adapter layer (BAL) in adapter based APE. For the verification, various experiments on the different size of BAL has been conducted, and through these experiments, optimal approaches in adapter based APE has been proposed. For the last, this work raises a question about the reason why leveraging external knowledge is influential in APE. In this regard, we conducted several comparative experiments on the method of utilizing external data to APE training to achieve a better performance. The results revealed that the performance can be improved by applying the method of concatenating the external data with the existing data when training the APE model.Through deep analysis on these experiments, this work propose the optimal research direction in APE.

show abstract

Section: E Question 2: Does the Adapter Structure Provide An Advantage In Ape?mentioning

confidence: 99%

An Empirical Study on Automatic Post Editing for Neural Machine Translation

Moon

Park

et al. 2021

IEEE Access

Self Cite

View full text Add to dashboard Cite

show abstract

“…It is difficult to establish a sufficient hardware environment to provide services, except for large companies such as Google and Facebook. In other words, as training a model involves many parameters and a large amount of data, companies that do not have sufficient server or GPU environments will find it difficult to configure the service environment and improve performance using the latest model (Park et al, 2020c). Therefore, it is important to ensure that companies with insufficient environments can provide services while performing well against LRLs.…”

Section: Introductionmentioning

confidence: 99%

BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text

Park¹,

Seo²,

Lee³

et al. 2021

Proceedings of the 8th Workshop on Asian Translation (WAT2021)

Self Cite

View full text Add to dashboard Cite

With the growing popularity of smart speakers, such as Amazon Alexa, speech is becoming one of the most important modes of humancomputer interaction. Automatic speech recognition (ASR) is arguably the most critical component of such systems, as errors in speech recognition propagate to the downstream components and drastically degrade the user experience. A simple and effective way to improve the speech recognition accuracy is to apply automatic post-processor to the recognition result. However, training a post-processor requires parallel corpora created by human annotators, which are expensive and not scalable. To alleviate this problem, we propose Back TranScription (BTS), a denoising-based method that can create such corpora without human labor. Using a raw corpus, BTS corrupts the text using Text-to-Speech (TTS) and Speech-to-Text (STT) systems. Then, a postprocessing model can be trained to reconstruct the original text given the corrupted input. Quantitative and qualitative evaluations show that a post-processor trained using our approach is highly effective in fixing non-trivial speech recognition errors such as mishandling foreign words. We present the generated parallel corpus and post-processing platform to make our results publicly available.

show abstract

“…To solve this problem, many researches are being conducted on the way of improving the performance of NLP application software without changing the model through data pre and post-processing, typically in machine translation (Pal et al, 2016;Currey et al, 2017;Banerjee and Bhattacharyya, 2018;Koehn et al, 2018;Kudo, 2018;Park et al, 2020b). Reflecting this trend, we conducted a study on an optimized tokenization that can improve the performance of neural machine translation (NMT) without changing the model.…”

Section: Introductionmentioning

confidence: 99%

Should we find another model?: Improving Neural Machine Translation Performance with ONE-Piece Tokenization Method without Model Modification

Park¹,

Eo²,

Moon³

et al. 2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

Self Cite

View full text Add to dashboard Cite

Most of the recent natural language processing (NLP) studies are based on the pretrainfinetuning approach (PFA). However for small and medium-sized industries with insufficient hardware, there are many limitations in servicing latest PFA based NLP application software, due to slow speed and insufficient memory. Since these approaches generally require large amounts of data, it is much more difficult to service with PFA especially for low-resource languages. We propose a new tokenization method, ONE-Piece, to address this limitation. ONE-Piece combines morphologically-aware subword tokenization and vocabulary communicating method, which has not been carefully considered before. Our proposed method can also be utilized without modifying the model structure. We experiment by applying ONE-Piece to Korean, a morphologically-rich and low-resource language. We revealed that ONE-Piece with vanilla transformer model can achieve comparable performance to the current Korean-English machine translation state-of-the-art model.

show abstract

Decoding Strategies for Improving Low-Resource Machine Translation

Cited by 22 publications

References 17 publications

An Empirical Study on Automatic Post Editing for Neural Machine Translation

An Empirical Study on Automatic Post Editing for Neural Machine Translation

BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text

Should we find another model?: Improving Neural Machine Translation Performance with ONE-Piece Tokenization Method without Model Modification

Contact Info

Product

Resources

About