2023
DOI: 10.1038/s41598-023-28328-2
|View full text |Cite
|
Sign up to set email alerts
|

A computational framework for physics-informed symbolic regression with straightforward integration of domain knowledge

Abstract: Discovering a meaningful symbolic expression that explains experimental data is a fundamental challenge in many scientific fields. We present a novel, open-source computational framework called Scientist-Machine Equation Detector (SciMED), which integrates scientific discipline wisdom in a scientist-in-the-loop approach, with state-of-the-art symbolic regression (SR) methods. SciMED combines a wrapper selection method, that is based on a genetic algorithm, with automatic machine learning and two levels of SR m… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 43 publications
(22 citation statements)
references
References 100 publications
(89 reference statements)
0
19
0
Order By: Relevance
“…It could be argued that we could have tackled the physical units validity of expressions in SR by using the Buckingham Π theorem (Buckingham 1914), with vari-ables and constants rendered dimensionless by means of multiplicative operations amongst them (such an approach has recently been proposed by Matchev et al 2022;Keren et al 2023). However, working on so called Π groups can make SR much more difficult in practice as it makes the dimensionless solutions often more complex.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…It could be argued that we could have tackled the physical units validity of expressions in SR by using the Buckingham Π theorem (Buckingham 1914), with vari-ables and constants rendered dimensionless by means of multiplicative operations amongst them (such an approach has recently been proposed by Matchev et al 2022;Keren et al 2023). However, working on so called Π groups can make SR much more difficult in practice as it makes the dimensionless solutions often more complex.…”
Section: Discussionmentioning
confidence: 99%
“…SR has traditionally been tackled using genetic algorithms where a population of candidate mathematical expressions are iteratively improved through operations inspired by natural evolution such as natural selection, crossover, and mutation. This type of approach includes the well known Eureqa software (Schmidt & Lipson 2009) (see Graham et al 2013 for a benchmark of Eureqa's capabilities on astrophysical test cases), as well as more recent works (Cranmer 2020;Virgolin & Bosman 2022;Stephens 2015;Kommenda et al 2020;Keren et al 2023). In addition, SR has been implemented using various methods ranging from brute force to (un-)guided Monte-Carlo, all the way to probabilistic searches (Mc-Conaghy 2011;Kammerer et al 2020;Bartlett et al 2022;Brence et al 2021;Jin et al 2019), as well as through problem simplification algorithms (Luo et al 2017;Tohme et al 2022).…”
Section: Related Work -A Brief Survey Of Modern Symbolic Regressionmentioning
confidence: 99%
See 1 more Smart Citation
“…In the present study, we develop a foundational symbolic embedding for physics that enables the entire expression tree graph to be tackled, as well as local units constraints. Unlike previous attempts to consider units in which data sets were rendered dimensionless before applying standard SR techniques (Udrescu & Tegmark 2020;Matchev et al 2022;Keren et al 2023), our approach allows us to anticipate the required units for the subsequent symbol to be generated in a partially composed mathematical expression. By adopting this approach, we not only focus on training a neural network to generate increasingly precise expressions, as in Petersen et al (2021a), but we also generate labels of the necessary units and actively train our neural network to adhere to such constraints.…”
Section: Physical Srmentioning
confidence: 99%
“…Indeed, it could be argued that we could have tackled physical units validity of expressions in SR by taking advantage of the Buckingham Π theorem (Buckingham 1914), with variables and constants rendered dimensionless by means of multiplicative operations among them. Such an approach can actually be adopted as a preliminary step in conjunction with any SR framework (see, e.g., Matchev et al 2022;Keren et al 2023). However, although working with so-called Π groups ensures the generation of physically valid expressions (since all terms become dimensionless), it simultaneously removes constraints imposed by dimensional analysis, complicating the SR process.…”
Section: Rules Of Units Requirementsmentioning
confidence: 99%