Aprendizado supervisionado com conjuntos de dados desbalanceados

Castro, Cristiano Leite de; Braga, Antônio de Pádua

doi:10.1590/s0103-17592011000500002

Cited by 9 publications

(9 citation statements)

References 78 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…We evaluated the predictive capability of the mechanistic models and the classification algorithms using widespread performance metrics for binary classification problems. According to Castro and Braga (2011), these criteria either focus on the detection of the minority class in unbalanced classification problems or consider the discrimination of both classes as having the same relevance. All metrics used in this assessment yield values between 0 (poor performance) and 1 (high performance).…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Statistical Pattern Recognition for Thresholding between Human Skin and Background in Color Images

Feitosa¹,

Santos²,

Oliveira³

et al. 2017

Journal of Computer Science

View full text Add to dashboard Cite

Many research works based on the tone of human skin have been developed to locate and track the human body for the purpose of recognition in color images. With respect to other techniques, some advantages of face detection based on skin color are the smaller processing time, invariant angles of rotation and the performance in semi-occluded faces. In this study we present the results of a survey that investigated the performance of 4 supervised classifiers in skin detection. In order to maximize the generalization of the models, a training set containing samples of individuals of different ages and ethnicities was used. Experimental results showed that the best performance was achieved by using an ANN and the worst results were yielded by LDA. With the Naive Bayes, QDA and ANN algorithms, we showed that the white, black, yellow and brown tones of human skin are in a well-defined range of the RGB color spectrum determined by common characteristics. We also compiled 2798 skin samples for treatment and 305 images with their manually obtained labels as supplementary material, which was made available to help in the development of further research in human skin detection.

show abstract

Section: Resultsmentioning

confidence: 99%

“…It is calculated as the harmonic mean of the recall and precision, usually being β = 1 (12). Β is used to determine the relative importance of recall and precision (Castro and Braga, 2011 …”

Section: Metrics For Performance Evaluationmentioning

confidence: 99%

Statistical Pattern Recognition for Thresholding between Human Skin and Background in Color Images

Feitosa¹,

Santos²,

Oliveira³

et al. 2017

Journal of Computer Science

View full text Add to dashboard Cite

show abstract

“…Muitos trabalhos na área de reconhecimento de padrões tem a análise de desempenho de seus algoritmos baseada na acurácia dos seus classificadores. Entretanto, sabe-se que a acurácia pode mascarar taxas de erros de classificação quando se trabalha com classes desbalanceadas [21].…”

Section: Critério De Desempenho -áRea Abaixo Da Curva Rocunclassified

“…Os gráficos para as curvas ROC são bidimensionais, nos quais o eixo das ordenadas plota-se a sensibilidade e no eixo das abscissas a especificidade [22]. A curva ROC de um classificador ideal possui o formato da função Heaviside (Heaviside step function) [21].…”

Section: Critério De Desempenho -áRea Abaixo Da Curva Rocunclassified

Otimização do desempenho de um classificador por modificação no processo de seleção de características

Tavares¹,

Barbosa²

2015

Anais Do 12. Congresso Brasileiro De Inteligência Computacional

View full text Add to dashboard Cite

Resumo-Este trabalho apresenta um algoritmo de otimização para o processo de reconhecimento de padrões atuando diretamente na etapa de seleção de características. O algoritmo proposto é baseado em computação evolucionária e conta com o método ReliefF para seleção de características aliado ao método Support Vector Machine (SVM) para classificação. As variáveis de busca para otimização são os K-Vizinhos Mais Próximos utilizado no ReliefF e o número de características selecionadas para o classificador, ou seja, a dimensão trabalhada pelo SVM. O algoritmo proposto produziu melhorias de performance no classificador SVM. Testes foram realizados em duas bases de dados e os resultados obtidos, comparados com o resultado de outros três algoritmos, comprovam o bom desempenho do algoritmo proposto.

show abstract

“…En el caso específico de dos clases, este problema se identifica porque existe un número muy pequeño de instancias de una de las clases, en comparación con el número de instancias de la otra. En la literatura existen diferentes estudios que abordan el problema del desbalanceo entre clases, muchos proponen soluciones específicas al problema [13]- [15] y otros pocos estudian las causas del mismo [16]- [19]. Sin embargo, la conclusión general es que ante un conjunto de datos de entrenamiento desbalanceado, los algoritmos de aprendizaje tradicionales generan superficies de decisión que tienden a estar sesgadas por la clase mayoritaria, como se ilustra en la Fig.…”

Section: El Problema De Clases Desbalanceadasunclassified

A Survey on Class Imbalance Learning on Automatic Visual Inspection

Mera

Branch

2014

IEEE Latin Am. Trans.

View full text Add to dashboard Cite

The supervised machine learning has been showing very useful for the automatic visual inspection task. However, little has been considered about use traditional machine learning techniques on a domain where the classes are imbalanced. This problem corresponds to dealing with the situation where one class outnumbers the other. Traditional machine learning algorithms trained with imbalance datasets can be biased towards the majority class, thus producing poor predictive accuracy over the minority class. In this paper, we present different approaches to address the class imbalance problem and how these approaches have been used in the context of automatic visual inspection. The literature shows there are few works that consider the class imbalance problem on automatic visual inspection task and it shows that the one class classification technique is the most used.

show abstract

Aprendizado supervisionado com conjuntos de dados desbalanceados

Cited by 9 publications

References 78 publications

Statistical Pattern Recognition for Thresholding between Human Skin and Background in Color Images

Statistical Pattern Recognition for Thresholding between Human Skin and Background in Color Images

Otimização do desempenho de um classificador por modificação no processo de seleção de características

A Survey on Class Imbalance Learning on Automatic Visual Inspection

Contact Info

Product

Resources

About