Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing 2017
DOI: 10.18653/v1/d17-1314
|View full text |Cite
|
Sign up to set email alerts
|

DOC: Deep Open Classification of Text Documents

Abstract: Abstract

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
227
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 242 publications
(227 citation statements)
references
References 25 publications
0
227
0
Order By: Relevance
“…Note that small perturbations of each input are added to the last feature layer in this baseline. 4) DOC [28]: m binary classifiers are built for m classes.…”
Section: Baselinesmentioning
confidence: 99%
“…Note that small perturbations of each input are added to the last feature layer in this baseline. 4) DOC [28]: m binary classifiers are built for m classes.…”
Section: Baselinesmentioning
confidence: 99%
“…Text classification tasks in real-world applications often consists of 2 components-In-Doman (ID) classification and Out-of-Domain (OOD) detection components Kim and Kim, 2018;Shu et al, 2017;Shamekhi et al, 2018). ID classification refers to classifying a user's input with a label that exists in the training data, and OOD detection refers to designate a special OOD tag to the input when it does not belong to any of the labels in the ID training dataset (Dai et al, 2007).…”
Section: Introductionmentioning
confidence: 99%
“…We used the one-vs-rest logistic regression instead of the multinomial logistic regression in order to obtain a probability cutoff of 0.5 to determine the unknown cell type. DOC was an advanced machine learning method for classifying unseen text documents, which was inherently similar to our problem and could be directly applied here 38 . The key idea of DOC was to find a data-driven probability cutoff for the unknown class rather than using a fixed probability cutoff of 0.5 as LR did.…”
Section: Comparison Approachesmentioning
confidence: 99%