Proceedings of the 17th ACM Conference on Information and Knowledge Management 2008
DOI: 10.1145/1458082.1458117
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting temporal contexts in text classification

Abstract: Due to the increasing amount of information being stored and accessible through the Web, Automatic Document Classification (ADC) has become an important research topic. ADC usually employs a supervised learning strategy, where we first build a classification model using pre-classified documents and then use it to classify unseen documents. One major challenge in building classifiers is dealing with the temporal evolution of the characteristics of the documents and the classes to which they belong. However, mos… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2011
2011
2015
2015

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 18 publications
(11 citation statements)
references
References 34 publications
0
11
0
Order By: Relevance
“…Our proposed method transforms to estimate the probability of a given feature belonging to a category, which can be calculated both in a global view and a local view, we will explain it later. In (6), P (c j |w k ) reflects the relationship between features and categories , that is how likely a particular feature given belongs to a category, it is somewhat similar to the concept of dominance which is used in a temporal weighting function mentioned by L. Rocha et al [10]. The normal way to calculate P (c j |w k ) is:…”
Section: Proposed Methods For Improving Naive Bayes Text Classifiermentioning
confidence: 99%
“…Our proposed method transforms to estimate the probability of a given feature belonging to a category, which can be calculated both in a global view and a local view, we will explain it later. In (6), P (c j |w k ) reflects the relationship between features and categories , that is how likely a particular feature given belongs to a category, it is somewhat similar to the concept of dominance which is used in a temporal weighting function mentioned by L. Rocha et al [10]. The normal way to calculate P (c j |w k ) is:…”
Section: Proposed Methods For Improving Naive Bayes Text Classifiermentioning
confidence: 99%
“…This dataset was obtained from [19]. In that work the authors considered the first level of the taxonomy so that each document article is classified under only one category, avoiding dealing with multilabel cases.…”
Section: Textual Datasetsmentioning
confidence: 99%
“…. ., a classifier is given a new training document set, Temporally-aware classification methods for batch learning have also been proposed [21,5,28,29]: a classifier is given D * t and Dt for all t = 1, 2, . .…”
Section: Problem Definitionsmentioning
confidence: 99%
“…Instance weighting methods assume that newer instances are more essential for classifying the current instance [15,18,5,11,28,29]. Lebanon et al proposed a naive Bayes classifier with a weighting scheme for handling temporal document streams [18].…”
Section: Learning Strategiesmentioning
confidence: 99%