Anais Do XXXVII Simpósio Brasileiro De Banco De Dados (SBBD 2022) 2022
DOI: 10.5753/sbbd.2022.224329
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale Translation to Enable Response Selection in Low Resource Languages: A COVID-19 Chatbot Experiment

Abstract: Natural Language Processing for Low Resource Languages is challenging. The lack of large-scale datasets affects the performance of data-hungry algorithms. To overcome this, we employ data augmentation to enlarge the training data for the task of response selection in multi-turn retrieval-based chatbots. We automatically translated a large-scale English dataset to Brazilian Portuguese (PT_BR) and used it to train a deep neural network. For a COVID-19 chatbot system, our results show that the combination of trai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 22 publications
0
1
0
Order By: Relevance
“…The work [52] applies data augmentation to increase training data to response selection in chatbots based on multi-turn recovering. They apply the automatic translation of a massive dataset for multi-turn chatbots from English to Brazilian Portuguese, train a deep neural network with the translated dataset, and tune the neural network using a COVID-19-related dialogues dataset.…”
Section: Related Workmentioning
confidence: 99%
“…The work [52] applies data augmentation to increase training data to response selection in chatbots based on multi-turn recovering. They apply the automatic translation of a massive dataset for multi-turn chatbots from English to Brazilian Portuguese, train a deep neural network with the translated dataset, and tune the neural network using a COVID-19-related dialogues dataset.…”
Section: Related Workmentioning
confidence: 99%