Breaking the Closed World Assumption in Text Classification

Fei, Geli; Liu, Bing

doi:10.18653/v1/n16-1061

Cited by 109 publications

(80 citation statements)

References 23 publications

(31 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The performance of the resulting model (i.e., ER+POG) on the OSQ and IPA dataset is depicted in Table II and Table III, respectively. 4 Note that FPR= FP FP+TN , and TPR= TP TP+FN , where FP, TP, TN, FN is the number of false positives, true positives, true negatives, false negatives, respectively. It can be seen that our model outperforms all the baselines on both datasets significantly, which demonstrates the effectiveness of the generated pseudo OOD utterances.…”

Section: E Effects Of Generated Pseudo Ood Utterancesmentioning

confidence: 99%

“…Recently, various deep neural network based NLU models are proposed and some of these models have been applied in real-world applications [1]- [3]. Most existing neural NLU modules are built by following a closed-world assumption [4], [5], i.e, the data used in the training and testing phrase are drawn from the same distribution. However, such an assumption is commonly violated in practical systems that are deployed in a dynamic or open environment.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Out-of-Domain Detection for Natural Language Understanding in Dialog Systems

Zheng

Chen

Huang

2020

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Natural Language Understanding (NLU) is a vital component of dialogue systems, and its ability to detect Out-of-Domain (OOD) inputs is critical in practical applications, since the acceptance of the OOD input that is unsupported by the current system may lead to catastrophic failure. However, most existing OOD detection methods rely heavily on manually labeled OOD samples and cannot take full advantage of unlabeled data. This limits the feasibility of these models in practical applications.In this paper, we propose a novel model to generate highquality pseudo OOD samples that are akin to IN-Domain (IND) input utterances, and thereby improves the performance of OOD detection. To this end, an autoencoder is trained to map an input utterance into a latent code. and the codes of IND and OOD samples are trained to be indistinguishable by utilizing a generative adversarial network. To provide more supervision signals, an auxiliary classifier is introduced to regularize the generated OOD samples to have indistinguishable intent labels. Experiments show that these pseudo OOD samples generated by our model can be used to effectively improve OOD detection in NLU. Besides, we also demonstrate that the effectiveness of these pseudo OOD data can be further improved by efficiently utilizing unlabeled data.

show abstract

Section: E Effects Of Generated Pseudo Ood Utterancesmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Out-of-Domain Detection for Natural Language Understanding in Dialog Systems

Zheng

Chen

Huang

2020

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

show abstract

“…The main idea is that a classifier should not cover too much open space with few or no training data, thereby rejecting the unknown images. cbsSVM [10] shares the similar ideas in text classification. However, these methods are all based on SVM, which fails to effectively capture the high-level semantic concept of intents comparing with deep neural networks.…”

Section: Related Workmentioning

confidence: 94%

“…How do we detect unknown intent without any prior knowledge about it? In [9,10], a m-class classifier should be able to reject examples from unknown class while performing m-class classification tasks. It is because not all test classes are known in the training set, which forms a (m+1)-class classification problem where the (m+1) th class represents the unknown class.…”

Section: Introductionmentioning

confidence: 99%

A post-processing method for detecting unknown intent of dialogue system via pre-trained deep neural network classifier

Lin

2019

Knowledge-Based Systems

View full text Add to dashboard Cite

With the maturity and popularity of dialogue systems, detecting user's unknown intent in dialogue systems has become an important task. It is also one of the most challenging tasks since we can hardly get examples, prior knowledge or the exact numbers of unknown intents. In this paper, we propose SofterMax and deep novelty detection (SMDN), a simple yet effective post-processing method for detecting unknown intent in dialogue systems based on pre-trained deep neural network classifiers. Our method can be flexibly applied on top of any classifiers trained in deep neural networks without changing the model architecture. We calibrate the confidence of the softmax outputs to compute the calibrated confidence score (i.e., SofterMax) and use it to calculate the decision boundary for unknown intent detection. Furthermore, we feed the feature representations learned by the deep neural networks into traditional novelty detection algorithm to detect unknown intents from different perspectives. Finally, we combine the methods above to perform the joint prediction. Our method classifies examples that differ from known intents as unknown and does not require any examples or prior knowledge of it. We have conducted extensive experiments on three benchmark dialogue datasets. The results show that our method can yield significant improvements compared with the state-of-the-art baselines 1 .

show abstract

“…Although most of research papers on text classification deal with the problem of closed‐set classification, some methods have been recently proposed to tackle open‐set text classification. Fei and Liu proposed a center‐based similarity method, which is based on a decision threshold (usually 0.5) on a posteriori probability to reject an observation as unrecognized, where the probabilities are estimated from SVM scores using Platt's algorithm . Doan and Kalita proposed an algorithm called nearest class mean, which attempts to find boundary regions for known classes using spheres centered at class centroids, with observations falling outside the sphere boundaries treated as either outliers or indicators of possible new unknown classes.…”

Section: Introduction – Problem Formulationmentioning

confidence: 99%

Algorithm based on modified angle‐based outlier factor for open‐set classification of text documents

Walkowiak

Datko

Maciejewski

2018

Appl Stoch Models Bus & Ind

View full text Add to dashboard Cite

This paper presents a new method of open‐set classification of text documents, with respect to subject areas. Standard (closed‐set) approaches to text classification involve training classifiers on annotated text corpora, representing a fixed number of subject areas. Such classifiers assign a new document with unknown annotation to one of the trained classes, even if the new document is not related to any class. We propose a two‐step procedure for open‐set classification. We first use a closed‐set classifier to assign a new document to one of the known classes. Then, we evaluate the (dis)similarity between the document and the chosen class using a novel criterion of outlierness named interquartile ranged angle‐based outlierness factors, which we find effective in high‐dimensional data. Based on this, we can avoid spurious assignment of documents to unrelated subject classes. We demonstrate the feasibility of this procedure in the task of subject classification of a collection of Wikipedia documents. As compared to the standard closed‐set approach, our open‐set classifier realizes significantly better precision with only small decrease of the recall measure observed in recognition of the tested classes.

show abstract

Breaking the Closed World Assumption in Text Classification

Cited by 109 publications

References 23 publications

Out-of-Domain Detection for Natural Language Understanding in Dialog Systems

Out-of-Domain Detection for Natural Language Understanding in Dialog Systems

A post-processing method for detecting unknown intent of dialogue system via pre-trained deep neural network classifier

Algorithm based on modified angle‐based outlier factor for open‐set classification of text documents

Contact Info

Product

Resources

About