Inductive learning algorithms and representations for text categorization

Dumais, Susan T.; Platt, John; Heckerman, David; Sahami, Mehran

doi:10.1145/288627.288651

Cited by 1,065 publications

(643 citation statements)

References 25 publications

Supporting

Mentioning

571

Contrasting

Unclassified

Order By: Relevance

“…A wide variety of learning approaches have been applied to TC, to name a few, Bayesian classification (Lewis and Ringuette 1994;Domingo and Pazzani 1996;Larkey and Croft 1996;Koller and Sahami 1997;Lewis 1998), decision trees (Weiss, Apte et al ;Fuhr and Buckley 1991;Cohen and Hirsh 1998;Li and Jain 1998), decision rule classifiers such as CHARADE (Moulinier and Ganascia 1996), or DL-ESC (Li and Yamanishi 1999), or RIPPER (Cohen and Hirsh 1998), or SCAR (Moulinier, Raskinis et al 1996), or SCAP-1 (Apté, Damerau et al 1994), multi-linear regression models (Yang and Chute 1994;Yang and Liu 1999), Rocchio method (Hull 1994;Ittner, Lewis et al 1995;Sable and Hatzivassiloglou 2000), Neural Networks (Schütze, Hull et al 1995;Wiener, Pedersen et al 1995;Dagan, Karov et al 1997;Ng, Goh et al 1997;Lam and Lee 1999;Ruiz and Srinivasan 1999), example based classifiers (Creecy 1991;Masand, Linoff et al 1992;Larkey 1999), support vector machines (Joachims 1998), Bayesian inference networks (Tzeras and Hartmann 1993;Wai and Fan 1997;Dumais, Platt et al 1998), genetic algorithms (Masand 1994;Clack, Farringdon et al 1997), and maximum entropy modelling (Manning and Schütze 1999).…”

Section: Machine Learning Approaches To Text Categorizationmentioning

confidence: 99%

“…Because this process is highly domain dependent and considering all possible combinations of tokens is impossible, many algorithms exist to define phrasal indexes. Although some researchers have reported an improvement in classification accuracy when using such indexes (depending on the quality of the generated phrases), a number of experimental results Apté, Damerau et al 1994;Dumais, Platt et al 1998) have not been uniformly encouraging, irrespective of whether the notion of "phrase" is motivated (i) syntactically, i.e. the phrase is such according to the grammar of the language ; or (ii) statistically, i.e.…”

Section: Indexingmentioning

confidence: 99%

“…Among these are the DIA association factor (Fuhr and Buckley 1991), chi-square (Yang and Pedersen 1997;Sebastiani, Sperduti et al 2000;Caropreso, Matwin et al 2001), NGL coefficient (Ng, Goh et al 1997;Ruiz and Srinivasan 1999), information gain Lewis and Ringuette 1994;Moulinier, Raskinis et al 1996;Yang and Pedersen 1997;Larkey 1998;Mladenic and Grobelnik 1998;Caropreso, Matwin et al 2001), mutual information (Larkey and Croft 1996;Wai and Fan 1997;Dumais, Platt et al 1998;Taira and Haruno 1999) odds ratio (Mladenic and Grobelnik 1998;Ruiz and Srinivasan 1999;Caropreso, Matwin et al 2001), relevancy score (Wiener, Pedersen et al 1995) and GSS coefficient (Galavotti, Sebastiani et al 2000). Three of the most popular methods are descrivbed briefly below.…”

Section: This Leads To the Term Frequency/inverse Document Frequency mentioning

confidence: 99%

See 2 more Smart Citations

Text and Hypertext Categorization

Benbrahim

Bramer

2009

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Automatic categorization of text documents has become an important area of research in the last two decades, with features that make it significantly more difficult than the traditional classification tasks studied in machine learning. A more recent development is the need to classify hypertext documents, most notably web pages. These have features that add further complexity to the categorization task but also offer the possibility of using information that is not available in standard text classification, such as metadata and the content of the web pages that point to and are pointed at by a web page of interest. This chapter surveys the state of the art in text categorization and hypertext categorization, focussing particularly on issues of representation that differentiate them from 'conventional' classification tasks and from each other.

show abstract

Section: Machine Learning Approaches To Text Categorizationmentioning

confidence: 99%

Section: Indexingmentioning

confidence: 99%

Section: This Leads To the Term Frequency/inverse Document Frequency mentioning

confidence: 99%

See 1 more Smart Citation

Text and Hypertext Categorization

Benbrahim

Bramer

2009

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

“…In the late '90s, Machine Learning techniques were successfully applied to Text Classification. Support Vector Machines were applied to Text Classification in [6,4]. Maximum Entropy Models were also applied in [8].…”

Section: Related Workmentioning

confidence: 99%

Multi-topic Aspects in Clinical Text Classification

Towfic

Dobbs

et al. 2007

2007 IEEE International Conference on Bioinformatics and Biomedicine (BIBM 2007)

View full text Add to dashboard Cite

show abstract

“…Following the previous works [14,15,10], we build binary classifiers for top ten most populous categories. In our experiment, stop words were not eliminated, and title words were not distinguished with body words.…”

Section: Reuters 21587 Text Categorization Test Collectionmentioning

confidence: 99%

Multinomial Event Model Based Abstraction for Sequence and Text Classification

Kang

Zhang

Silvescu

et al. 2005

Lecture Notes in Computer Science

View full text Add to dashboard Cite

In many machine learning applications that deal with sequences, there is a need for learning algorithms that can effectively utilize the hierarchical grouping of words. We introduce Word Taxonomy guided Naive Bayes Learner for the Multinomial Event Model (WTNBL-MN) that exploits word taxonomy to generate compact classifiers, and Word Taxonomy Learner (WTL) for automated construction of word taxonomy from sequence data. WTNBL-MN is a generalization of the Naive Bayes learner for the Multinomial Event Model for learning classifiers from data using word taxonomy. WTL uses hierarchical agglomerative clustering to cluster words based on the distribution of class labels that co-occur with the word counts. Our experimental results on protein localization sequences and Reuters text show that the proposed algorithms can generate Naive Bayes classifiers that are more compact and similar or often more accurate than those produced by standard Naive Bayes learner for the Multinomial Model. Abstract. In many machine learning applications that deal with sequences, there is a need for learning algorithms that can effectively utilize the hierarchical grouping of words. We introduce Word Taxonomy guided Naive Bayes Learner for the Multinomial Event Model (WTNBL-MN) that exploits word taxonomy to generate compact classifiers, and Word Taxonomy Learner (WTL) for automated construction of word taxonomy from sequence data. WTNBL-MN is a generalization of the Naive Bayes learner for the Multinomial Event Model for learning classifiers from data using word taxonomy. WTL uses hierarchical agglomerative clustering to cluster words based on the distribution of class labels that co-occur with the word counts. Our experimental results on protein localization sequences and Reuters text show that the proposed algorithms can generate Naive Bayes classifiers that are more compact and similar or often more accurate than those produced by standard Naive Bayes learner for the Multinomial Model. Disciplines Artificial Intelligence and Robotics

show abstract

Inductive learning algorithms and representations for text categorization

Cited by 1,065 publications

References 25 publications

Text and Hypertext Categorization

Text and Hypertext Categorization

Multi-topic Aspects in Clinical Text Classification

Multinomial Event Model Based Abstraction for Sequence and Text Classification

Contact Info

Product

Resources

About