Genetic Programming for Evolving a Front of Interpretable Models for Data Visualization

Lensen, Andrew; Xue, Bing; Zhang, Mengjie

doi:10.26686/wgtn.12493820.v1

Cited by 9 publications

(15 citation statements)

References 26 publications

(37 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In fact, we agree that is simplistic. However, we believe that minimizing represents one of the first baselines to compare against (and it was the only one we found being used to specifically promote interpretability [22]), and that designing a competitive baseline is non-trivial. We will investigate this further in future work.…”

Section: Discussionmentioning

confidence: 99%

“…The authors of [43] study whether modern model-based GP can be useful when particularly compact symbolic regression solutions are sought, to allow interpretability. A very different take to enable or improve interpretability is taken in [22,41,45], where interpretability is sought by means of feature construction and dimensionality reduction. In [22] in particular, MOGP is used, with solution size as a simple PHI.…”

Section: Related Workmentioning

confidence: 99%

“…A very different take to enable or improve interpretability is taken in [22,41,45], where interpretability is sought by means of feature construction and dimensionality reduction. In [22] in particular, MOGP is used, with solution size as a simple PHI. Importantly, none of these works takes attempts to learn a PHI from data.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Learning a Formula of Interpretability to Learn Interpretable Formulas

Virgolin

Lorenzo

Medvet

et al. 2020

Parallel Problem Solving From Nature – PPSN XVI

View full text Add to dashboard Cite

Many risk-sensitive applications require Machine Learning (ML) models to be interpretable. Attempts to obtain interpretable models typically rely on tuning, by trial-and-error, hyper-parameters of model complexity that are only loosely related to interpretability. We show that it is instead possible to take a meta-learning approach: an ML model of non-trivial Proxies of Human Interpretability (PHIs) can be learned from human feedback, then this model can be incorporated within an ML training process to directly optimize for interpretability. We show this for evolutionary symbolic regression. We first design and distribute a survey finalized at finding a link between features of mathematical formulas and two established PHIs, simulatability and decomposability. Next, we use the resulting dataset to learn an ML model of interpretability. Lastly, we query this model to estimate the interpretability of evolving solutions within bi-objective genetic programming. We perform experiments on five synthetic and eight real-world symbolic regression problems, comparing to the traditional use of solution size minimization. The results show that the use of our model leads to formulas that are, for a same level of accuracy-interpretability trade-off, either significantly more or equally accurate. Moreover, the formulas are also arguably more interpretable. Given the very positive results, we believe that our approach represents an important stepping stone for the design of next-generation interpretable (evolutionary) ML algorithms.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Learning a Formula of Interpretability to Learn Interpretable Formulas

Virgolin

Lorenzo

Medvet

et al. 2020

Parallel Problem Solving From Nature – PPSN XVI

View full text Add to dashboard Cite

show abstract

“…For decision trees and decision rules, besides reducing the number of features, approaches exist to restrict model size, prune unnecessary parts [5,26], aggregate local models in a hierarchy [48], or promote a trade-off between accuracy and complexity by means of loss functions [30,52] or prior distributions [33,60,61]. Regarding GP (and close relatives like grammatical evolution), perhaps the most simple and popular strategy to favor interpretability is to restrain the number of model components [16,31,57], sometimes in elaborate ways or particular settings [6,32,40,49,56]. Another strategy consists of penalizing models according to a weighted sum of the components that they include, after having pre-determined a weighing scheme [23,36].…”

Section: Related Workmentioning

confidence: 99%

Model Learning with Personalized Interpretability Estimation (ML-PIE)

Virgolin¹,

Lorenzo²,

Randone³

et al. 2021

Preprint

View full text Add to dashboard Cite

Figure 1: Schematic view of the proposed approach, ML-PIE. In the implementation proposed in this paper, the user provides feedback on models that are being discovered by an evolutionary algorithm. This feedback is used to train an estimator which, in turn, shapes one of the objective functions used by the evolution. Ultimately, this steers the evolution towards discovering models that are interpretable according to the specific user. To minimize the amount of feedback needed, ML-PIE keeps track of which models cause the estimator to be most uncertain, and submits these models for user assessment.

show abstract

“…Recently there has been new application perspectives for Genetic Programming (GP) regarding the increasing need for interpretable results. GP was for example used to provide interpretable policies in reinforcement learning [14], to learn manifolds [25], to create visualizations [26] or to explain complex deep learning models [10]. For dimensionality reduction tasks, GP has also been used a lot as a feature construction method [35].…”

Section: Grammar-guided Genetic Programmingmentioning

confidence: 99%

Interpretable Dimensionally-Consistent Feature Extraction from Electrical Network Sensors

Crochepierre

Boudjeloud-Assala

Barbesant

2021

Machine Learning and Knowledge Discovery in Databases: Applied Data Science Track

View full text Add to dashboard Cite

Electrical power networks are heavily monitored systems, requiring operators to perform intricate information synthesis before understanding the underlying network state. Our study aims at helping this synthesis step by automatically creating features from the sensor data. We propose a supervised feature extraction approach using a grammar-guided evolution, which outputs interpretable and dimensionally consistent features. Operations restrictions on dimensions are introduced in the learning process through context-free grammars. They ensure coherence with physical laws, dimensional-consistency, and also introduce technical expertise in the created features. We compare our approach to other state-of-theart feature extraction methods on a real dataset taken from the French electrical network sensors.

show abstract

Genetic Programming for Evolving a Front of Interpretable Models for Data Visualization

Cited by 9 publications

References 26 publications

Learning a Formula of Interpretability to Learn Interpretable Formulas

Learning a Formula of Interpretability to Learn Interpretable Formulas

Model Learning with Personalized Interpretability Estimation (ML-PIE)

Interpretable Dimensionally-Consistent Feature Extraction from Electrical Network Sensors

Contact Info

Product

Resources

About