2021
DOI: 10.1007/s12652-021-03439-8
|View full text |Cite
|
Sign up to set email alerts
|

Chatbot Interaction with Artificial Intelligence: human data augmentation with T5 and language transformer ensemble for text classification

Abstract: In this work we present the Chatbot Interaction with Artificial Intelligence (CI-AI) framework as an approach to the training of a transformer based chatbot-like architecture for task classification with a focus on natural human interaction with a machine as opposed to interfaces, code, or formal commands. The intelligent system augments human-sourced data via artificial paraphrasing in order to generate a large set of training data for further classical, attention, and language transformation-based learning a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 43 publications
(27 citation statements)
references
References 54 publications
2
13
0
Order By: Relevance
“…BERT and its derivatives (RoBERTa, ALBERT, BART, etc.) are state-of-the-art models that often outperform deep learning and classical models due to their simple yet advanced technique [96], [105], [106]. The BERT model includes many parameters, operates on a large scale, and has high latency [107].…”
Section: Text Classificationmentioning
confidence: 99%
“…BERT and its derivatives (RoBERTa, ALBERT, BART, etc.) are state-of-the-art models that often outperform deep learning and classical models due to their simple yet advanced technique [96], [105], [106]. The BERT model includes many parameters, operates on a large scale, and has high latency [107].…”
Section: Text Classificationmentioning
confidence: 99%
“…Researchers around the world are inspired to develop the state of-the-art chatbots by embedding different machine learning algorithms such as naïve Bayes algorithm and support vector machine (SVM) [4,5].…”
Section: Related Workmentioning
confidence: 99%
“…This model has previously been successfully leveraged for data augmentation through paraphrase generation in a real-world application pertaining to the development of a human-robot interaction framework [106]. We also explored an alternative T5 model fine-tuned on the Google PAWS dataset for this task, but it was not utilised as most paraphrases generated were completely duplicate repetitions of the original questions.…”
Section: ) Utilising the T5 Model For Paraphrase Generationmentioning
confidence: 99%