2017
DOI: 10.1007/978-3-319-62410-5_16
|View full text |Cite
|
Sign up to set email alerts
|

Predicting the risk of suffering chronic social exclusion with machine learning

Abstract: The fight against social exclusion is at the heart of the Europe 2020 strategy: 120 million people are at risk of suffering this condition in the EU. Risk prediction models are widely used in insurance companies and health services. However, the use of these models to allow an early detection of social exclusion by social workers is not a common practice. This paper describes a data analysis of over 16K cases with over 60 predictors from the Spanish region of Castilla y León. The use of machine learning paradi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 4 publications
0
3
0
Order By: Relevance
“…Although the architecture has not been implemented, many of the ideas and proposals have been put into practice for the generation of an online social exclusion prediction service in the Spanish region of Castilla y Le´on [17]. Future work includes: implementing the architecture in a multi-agent plat form; extending the decision support system for the labeling service; considering different machine learning problems as multi-instance learning and multi-label learning; and, a better exploitation of the ontologies and semantic resources to include forms of advanced learning such as case-based reasoning, transfer learn ing, and graph mining.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Although the architecture has not been implemented, many of the ideas and proposals have been put into practice for the generation of an online social exclusion prediction service in the Spanish region of Castilla y Le´on [17]. Future work includes: implementing the architecture in a multi-agent plat form; extending the decision support system for the labeling service; considering different machine learning problems as multi-instance learning and multi-label learning; and, a better exploitation of the ontologies and semantic resources to include forms of advanced learning such as case-based reasoning, transfer learn ing, and graph mining.…”
Section: Discussionmentioning
confidence: 99%
“…According to the number of variables and their possible values, the Layer 2 agents may report for their study: a hierarchical clustering dendrogram to compare the similarity between the representatives (so that if one receives a label, the most similar cases could, but not necessarily, receive similar labels); multi-dimensional visualizations (such as Chernoff faces or Star diagrams); plots of the two or three most important variables obtained after a dimensionality reduction process if they are reasonably representative of all dimensions considered, etcetera. In a prediction service of the chronicity of social exclusion built with the ideas presented here [17], 63 variables were considered, hence labeling a few cases was very complex because of the number of dimensions to be studied.…”
Section: *^\ /mentioning
confidence: 99%
“…There is no universal rule to define the percentages of cases assigned to training and validation. Here we follow previous work and Pareto’s Principle ( Pareto, 1896 ; Dunford et al, 2014 ; Serrano et al, 2017 ) and use 80% of positives for training and 20% for validation. A larger percentage in validation may lose too much information needed for learning, but a smaller percentage may give an improper confidence because of the variability associated to a small amount of cases.…”
Section: Methodsmentioning
confidence: 99%