2017
DOI: 10.1088/1367-2630/aa57c2
|View full text |Cite
|
Sign up to set email alerts
|

Uncovering structure-property relationships of materials by subgroup discovery

Abstract: Subgroup discovery (SGD) is presented here as a data-mining approach to help find interpretable local patterns, correlations, and descriptors of a target property in materials-science data. Specifically, we will be concerned with data generated by density-functional theory calculations. At first, we demonstrate that SGD can identify physically meaningful models that classify the crystal structures of 82 octet binary (OB) semiconductors as either rocksalt or zincblende. SGD identifies an interpretable two-dimen… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
74
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
8
2

Relationship

4
6

Authors

Journals

citations
Cited by 94 publications
(77 citation statements)
references
References 101 publications
0
74
0
1
Order By: Relevance
“…These include the Δ-approach to ML, that in order to increase prediction accuracy, it uses as the learning target the difference between a lower-quality model to the property of interest [448]. Another technique is the subgroup discovery, which finds local structure in data, as opposed to mapping a unique global relationship [449]. And the recent multi-fidelity learning which aims to be applied to small datasets, where in order to enhance the sampling and therefore learning capacity, one can combine lower precision data to overcome the scarcity of higher precision data [450].…”
Section: Discovery Energies and Stabilitymentioning
confidence: 99%
“…These include the Δ-approach to ML, that in order to increase prediction accuracy, it uses as the learning target the difference between a lower-quality model to the property of interest [448]. Another technique is the subgroup discovery, which finds local structure in data, as opposed to mapping a unique global relationship [449]. And the recent multi-fidelity learning which aims to be applied to small datasets, where in order to enhance the sampling and therefore learning capacity, one can combine lower precision data to overcome the scarcity of higher precision data [450].…”
Section: Discovery Energies and Stabilitymentioning
confidence: 99%
“…They concern topics like crystal-structure prediction, property prediction, error estimates, classification of materials, and more. Some of these notebooks correspond to peer-reviewed publications [23][24][25][26][27][28][29][30][31][32][33][34].…”
Section: The Nomad Laboratory Conceptmentioning
confidence: 99%
“…The target of interest is the energy difference δ E between rocksalt and zincblende crystal structures. It is still an open challenge to find that combination of physical properties that can fully explain δ E [5], [6]. For causal inference, it suffices to know that the energy difference is influenced by the descriptor variables and not the other way round.…”
Section: Case Study: Octet Binary Semi Conductorsmentioning
confidence: 99%