Social Media Text Classification under Negative Covariate Shift

Fei, Geli; Liu, Bing

doi:10.18653/v1/d15-1282

Cited by 32 publications

(42 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Second, it only focuses on improving accuracy on the classes with seed examples. Our work in [14] dealt with the problem using an entirely different approach adopted from [13]. However, these works did not propose or deal with cumulative learning, which is important for an intelligent system as it allows the system to learn more and more and become more and more knowledgeable.…”

Section: Open World Classificationmentioning

confidence: 99%

“…At time + 1, the new dataset +1 of class +1 arrives, and the classification model needs to be updated or extended to produce a new classification model +1 . We note that each ℎ in or +1 is a 1-vs-rest SVM classifier built using the CBS learning method in [13] for class treating as the positive class. We will discuss CBS learning in the next section.…”

Section: Training a Cumulative Classification Modelmentioning

confidence: 99%

“…This section gives an overview of the CBS learning method given in [13]. In [14], is defined as the positively labeled area that is sufficiently far from the center of the positive training examples.…”

Section: Open Learning For Unseen Class Detectionmentioning

confidence: 99%

“…To build an open world classifier, a recent learning technique called center-based similarity space learning (CBS learning), which was originally proposed by Fei and Liu [13] for solving a negative covariate shift problem, is employed to give an initial solution to the proposed open space formulation, which significantly reduces the open space risk compared to that of [31].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Learning Cumulatively to Become More Knowledgeable

Fei

Wang

Liu

2016

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Self Cite

View full text Add to dashboard Cite

In classic supervised learning, a learning algorithm takes a fixed training data of several classes to build a classifier. In this paper, we propose to study a new problem, i.e., building a learning system that learns cumulatively. As time goes by, the system sees and learns more and more classes of data and becomes more and more knowledgeable. We believe that this is similar to human learning. We humans learn continuously, retaining the learned knowledge, identifying and learning new things, and updating the existing knowledge with new experiences. Over time, we cumulate more and more knowledge. A learning system should be able to do the same. As algorithmic learning matures, it is time to tackle this cumulative machine learning (or simply cumulative learning) problem, which is a kind of lifelong machine learning problem. It presents two major challenges. First, the system must be able to detect data from unseen classes in the test set. Classic supervised learning, however, assumes all classes in testing are known or seen at the training time. Second, the system needs to be able to selectively update its models whenever a new class of data arrives without retraining the whole system using the entire past and present training data. This paper proposes a novel approach and system to tackle these challenges. Experimental results on two datasets with learning from 2 classes to up to 100 classes show that the proposed approach is highly promising in terms of both classification accuracy and computational efficiency.

show abstract

Section: Open World Classificationmentioning

confidence: 99%

Section: Training a Cumulative Classification Modelmentioning

confidence: 99%

Section: Open Learning For Unseen Class Detectionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Learning Cumulatively to Become More Knowledgeable

Fei

Wang

Liu

2016

Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Self Cite

View full text Add to dashboard Cite

show abstract

“…The authors in [44] reported that a linear SVM achieves the best results consistently to SVM with different kernels including SVM-Poly. The authors in [7,16,26] also reported the linear SVM efficiency for binary text classification. Unfortunately, manual hyperparameter selection still remains one of the practical application issues, while recent literature still does not provide any heuristic rules or rules of thumb for this task in [41].…”

Section: Introductionmentioning

confidence: 99%

Support vector machine parameter tuning based on particle swarm optimization metaheuristic

Korovkinas

Danėnas

Garšva

2020

NAMC

View full text Add to dashboard Cite

This paper introduces a method for linear support vector machine parameter tuning based on particle swarm optimization metaheuristic, which is used to find the best cost (penalty) parameter for a linear support vector machine to increase textual data classification accuracy. Additionally, majority voting based ensembling is applied to increase the efficiency of the proposed method. The results were compared with results from our previous research and other authors' works. They indicate that the proposed method can improve classification performance for a sentiment recognition task.

show abstract