Subword Semantic Hashing for Intent Classification on Small Datasets

Shridhar, Kumar; Dash, Ayushman; Sahu, Amit; Pihlgren, Gustav Grund; Alonso, Pedro; Pondenkandath, Vinaychandran; Kovács, George L.; Simistira, Foteini; Liwicki, Marcus

doi:10.1109/ijcnn.2019.8852420

Cited by 21 publications

(14 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…On AskUbuntu and WebApp we compared our solution with popular architectures 1 as well as current state-of-theart solution on those datasets -Subword Semantic Hashing (SSH) (Shridhar et al 2018 We also compared our solution with the original capsule implementation (Zhang et al 2018) that was tested on the ATIS dataset as well as the current SOTA on this dataset (Chen and Yu 2019 The results show that our solution is able to achieve stateof-the-art results on datasets with small number of examples, such as AskUbuntu and WebApp, as well as larger datasets like ATIS.…”

Section: Experiments and Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Transformer-Capsule Model for Intent Detection (Student Abstract)

Obuchowski

Lew

2020

AAAI

View full text Add to dashboard Cite

Intent recognition is one of the most crucial tasks in NLU systems, which are nowadays especially important for designing intelligent conversation. We propose a novel approach to intent recognition which involves combining transformer architecture with capsule networks. Our results show that such architecture performs better than original capsule-NLU network implementations and achieves state-of-the-art results on datasets such as ATIS, AskUbuntu ,and WebApp.

show abstract

Section: Experiments and Resultsmentioning

confidence: 99%

“…We have tested our solution on AskUbuntu and WebApp where we achieved results of 89% and 92%, outperforming current state-of-the-art solution (Shridhar et al 2018), as well as on ATIS dataset where we achieved 98.89% with current state-of-the-art (Chen and Yu 2019) at 98.61%.…”

Section: Introductionmentioning

confidence: 82%

Transformer-Capsule Model for Intent Detection (Student Abstract)

Obuchowski

Lew

2020

AAAI

View full text Add to dashboard Cite

show abstract

“…Adversarial training method for the multi-task and multi-lingual joint modelling [Mohasseb et al 2018] Grammar feature exploration Grammar-based framework with 3 main features [Xie et al 2018] Short text; Semantic feature expansion Semantic Tag-empowered combined features [Qiu et al 2018] Potential consciousness information mining A similarity calculation method based on LSTM and a traditional machine learning method based on multi-feature extraction OOD utterances Multi-task learning [Cohan et al 2019] Utilisation of naturally labelled data Multitask learning based on joint loss [Shridhar et al 2019] OOV issue; Small/lack of labelled training data Subword semantic hashing ] Learning of deep semantic information Hybrid CNN and bidirectional GRU neural network with pretrained embeddings (Char-CNN-BGRU) [Lin and Xu 2019] Emerging intents detection Maximise inter-class variance and minimise intra-class variance to get the discriminative feature [Ren and Xue 2020] Similar utterance with different intent Triples of samples used for training [Yilmaz and Toraman 2020] OOD utterances KL divergence vector for classification [Costello et al 2018] developed a novel multi-layer ensembling approach that ensembles both different model initialisation and different model architectures to determine how multi-layer ensembling improves performance on multilingual intent classification. They constructed a CNN with character-level embedding and a bidirectional CNN with attention mechanism.…”

Section: Papermentioning

confidence: 99%

A survey of joint intent detection and slot-filling models in natural language understanding

Weld¹,

Huang²,

Long³

et al. 2021

Preprint

View full text Add to dashboard Cite

Intent classification and slot filling are two critical tasks for natural language understanding. Traditionally the two tasks have been deemed to proceed independently. However, more recently, joint models for intent classification and slot filling have achieved state-of-the-art performance, and have proved that there exists a strong relationship between the two tasks. This article is a compilation of past work in natural language understanding, especially joint intent classification and slot filling. We observe three milestones in this research so far: Intent detection to identify the speaker's intention, slot filling to label each word token in the speech/text, and finally, joint intent classification and slot filling tasks. In this article, we describe trends, approaches, issues, data sets, evaluation metrics in intent classification and slot filling. We also discuss representative performance values, describe shared tasks, and provide pointers to future work, as given in prior works. To interpret the state-of-the-art trends, we provide multiple tables that describe and summarise past research along different dimensions, including the types of features, base approaches, and dataset domain used.

show abstract

“…The investigation reveals that IBM Watson significantly outperforms other platforms as Dialogflow, MS LUIS and Rasa that also demonstrate very good results. Three English benchmark datasets, i.e., askUbuntu, chatbot and webApps [8] were used in the experiments [9]. Authors introduce a sub-word semantic hashing technique to process input texts before classification.…”

Section: Related Workmentioning

confidence: 99%

Intent Detection-Based Lithuanian Chatbot Created via Automatic DNN Hyper-Parameter Optimization

Kapočiūtė-Dzikienė¹

2020

Frontiers in Artificial Intelligence and Applications

View full text Add to dashboard Cite

In this paper, we tackle an intent detection problem for the Lithuanian language with the real supervised data. Our main focus is on the enhancement of the Natural Language Understanding (NLU) module, responsible for the comprehension of user’s questions. The NLU model is trained with a properly selected word vectorization type and Deep Neural Network (DNN) classifier. During our experiments, we have experimentally investigated fastText and BERT embeddings. Besides, we have automatically optimized different architectures and hyper-parameters of the following DNN approaches: Long Short-Term Memory (LSTM), Bidirectional LSTM (BiLSTM) and Convolutional Neural Network (CNN). The highest accuracy=∼0.715 (∼0.675 and ∼0.625 over random and majority baselines, respectively) was achieved with the CNN classifier applied on a top of BERT embeddings. The detailed error analysis revealed that prediction accuracies degrade for the least covered intents and due to intent ambiguities; therefore, in the future, we are planning to make necessary adjustments to boost the intent detection accuracy for the Lithuanian language even more.

show abstract

Subword Semantic Hashing for Intent Classification on Small Datasets

Cited by 21 publications

References 11 publications

Transformer-Capsule Model for Intent Detection (Student Abstract)

Transformer-Capsule Model for Intent Detection (Student Abstract)

A survey of joint intent detection and slot-filling models in natural language understanding

Intent Detection-Based Lithuanian Chatbot Created via Automatic DNN Hyper-Parameter Optimization

Contact Info

Product

Resources

About