2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA) 2017
DOI: 10.1109/icmla.2017.0-134
|View full text |Cite
|
Sign up to set email alerts
|

HDLTex: Hierarchical Deep Learning for Text Classification

Abstract: Increasingly large document collections require improved information processing methods for searching, retrieving, and organizing text. Central to these information processing methods is document classification, which has become an important application for supervised learning. Recently the performance of traditional supervised classifiers has degraded as the number of documents has increased. This is because along with growth in the number of documents has come an increase in the number of categories. This pa… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
229
0
3

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 317 publications
(233 citation statements)
references
References 35 publications
1
229
0
3
Order By: Relevance
“…• Web of Science (WOS): This dataset was used in two previous works on hierarchical text classification (Kowsari et al, 2017;Sinha et al, 2018). It contains 134 topics, split across 7 parent categories.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…• Web of Science (WOS): This dataset was used in two previous works on hierarchical text classification (Kowsari et al, 2017;Sinha et al, 2018). It contains 134 topics, split across 7 parent categories.…”
Section: Methodsmentioning
confidence: 99%
“…Thus, adding or removing any label requires changing the model architecture. Second, while it is possible to retain some model parameters, such as in hierarchical classification models, these architectures must still learn separate weights for every new class or sub-class (Cai and Hofmann, 2004;Kowsari et al, 2017). This is problematic because the new class labels often come with very few training examples, providing insufficient information for learning accurate model weights.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Text classification problems have been widely studied and addressed in many real applications [1][2][3][4][5][6][7][8] over the last few decades. Especially with recent breakthroughs in Natural Language Processing (NLP) and text mining, many researchers are now interested in developing applications that leverage text classification methods.…”
Section: Introductionmentioning
confidence: 99%
“…The collection and curation of large repositories of information has led to significant advancements in the training of machine learning approaches in areas as diverse as image recognition [42,89], textual analysis [47,63,98], and speech recognition [45,77,83,102]. In particular, in the realm of biomedical sciences, large relatively mature collections of information have been assembled covering areas such as genes and proteins [12,58], biological processes and pathways [5,57,71], drugs [1,46,76], and diseases [53,80,85].…”
Section: Introductionmentioning
confidence: 99%