Boosting to correct inductive bias in text classification

Liu, Yan; Yang, Yiming; Carbonell, Jaime G.

doi:10.1145/584792.584850

Cited by 26 publications

(13 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…DML is a technique used to identify a suitable distance metric based on the data projection that can be divided into four families [30]. The first two families are based on the supervision of the method: supervised and unsupervised DML.…”

Section: Derma: Melanoma Diagnosis Based On Collaborative Multilabmentioning

confidence: 99%

DERMA: A Melanoma Diagnosis Platform Based on Collaborative Multilabel Analog Reasoning

Nicolás-Sans

Fornells

Golobardes

et al. 2014

The Scientific World Journal

View full text Add to dashboard Cite

The number of melanoma cancer-related death has increased over the last few years due to the new solar habits. Early diagnosis has become the best prevention method. This work presents a melanoma diagnosis architecture based on the collaboration of several multilabel case-based reasoning subsystems called DERMA. The system has to face up several challenges that include data characterization, pattern matching, reliable diagnosis, and self-explanation capabilities. Experiments using subsystems specialized in confocal and dermoscopy images have provided promising results for helping experts to assess melanoma diagnosis.

show abstract

Section: Derma: Melanoma Diagnosis Based On Collaborative Multilabmentioning

confidence: 99%

DERMA: A Melanoma Diagnosis Platform Based on Collaborative Multilabel Analog Reasoning

Nicolás-Sans

Fornells

Golobardes

et al. 2014

The Scientific World Journal

View full text Add to dashboard Cite

show abstract

“…Possible learning methods include regression models, nearest neighbor classifiers, decision trees, Bayesian probabilistic classifiers, inductive rule learning algorithms, neural networks, online learning approaches, support vector machines, genetic programming techniques, and many hybrid methods. Instead of fixating on a single classification technique, the research will explore ensemble approaches that combine different techniques, such as bagging [2], boosting [5,21] and staged approaches.…”

Section: Supporting Multi-dimensional Design Explorationmentioning

confidence: 99%

Supporting case-based design for packaged software implementations

Cao

2008

2008 12th International Conference on Computer Supported Cooperative Work in Design

View full text Add to dashboard Cite

Design in packaged software implementation (PSI) is the process to solve business problems by customizing and integrating the off-the-shelf software package. PSI experts frequently practice case-based design (CBD) when facing a new problem situation: explore the past design cases, find a similar case, and reuse the design for that case in the new problem situation. The success of CBD depends on a continuous cycle of knowledge creation. This paper presents a theoretical framework of case-based design as an organizational knowledge creation process. Based on this framework, the research proposes an innovative tool to support CBD in packaged software implementations. The fundamental belief is that by utilizing the collective power of a large group of people, better designs can be achieved at lower costs with lower risks.

show abstract

“…Centroid-based algorithm [1] is a commonly used method for text categorization due to the simplicity and linearity. But it often suffers from the inductive bias [2] or model misfit [3] and researches have proposed some methods to further adjust the centroids to make the centroidbased algorithm perform better.…”

Section: Introductionmentioning

confidence: 99%

A new algorithm based on centroid for text categorization

Shen

2012

2012 9th International Conference on Fuzzy Systems and Knowledge Discovery

View full text Add to dashboard Cite

Text categorization is a hot topic and a key technology in data mining and information retrieval, so that it received wide attention recently. Centroid-based algorithm is an effective and robust approach. However it often suffers from the inductive bias or model misfit. In order to solve this problem, many researchers have put forward a number of improvement strategies which makes the centroid-based algorithm have a better performance. The paper proposed a novel approach to adjust the centroids which is called Weighted Margin adjusted Centroid based Algorithm (WMCA). Then it presented a lot of experimental comparison with some other algorithms by using 5 different public corpuses. The results showed that the WMCA algorithm has the best performance.

show abstract

Boosting to correct inductive bias in text classification

Cited by 26 publications

References 21 publications

DERMA: A Melanoma Diagnosis Platform Based on Collaborative Multilabel Analog Reasoning

DERMA: A Melanoma Diagnosis Platform Based on Collaborative Multilabel Analog Reasoning

Supporting case-based design for packaged software implementations

A new algorithm based on centroid for text categorization

Contact Info

Product

Resources

About