2022
DOI: 10.1021/acs.chemmater.2c01333
|View full text |Cite
|
Sign up to set email alerts
|

A Strategic Approach to Machine Learning for Material Science: How to Tackle Real-World Challenges and Avoid Pitfalls

Abstract: The exponential growth and success of machine learning (ML) has resulted in its application in all scientific domains including material science. Advancement in experimental techniques has led to an increase in the volume of material science data encouraging material scientists to investigate data-driven solutions to scientific problems. While the resources available to get started with ML are ever increasing, there is little literature on traversing through the space of decisions that need to be made to imple… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 45 publications
0
10
0
Order By: Relevance
“…Machine learning and deep learning approaches are increasingly being applied to spectroscopy data and material characterization. We have previously reported on a deep learning approach that we applied to the study of complex IR spectroscopy data, in which we trained an artificial neural network β-variational autoencoder (β-VAE) on a data set of 25,000 IR spectra of PEX-a pipes . β-VAEs are deep generative models capable of learning disentangled (independent and interpretable) representations of the generative factors responsible for variance in data. ,, For PEX-a pipe IR spectra, the physicochemical processes that can occur during pipe aging, degradation, and cracking are the underlying generative factors responsible for the variance in the spectra .…”
Section: Introductionmentioning
confidence: 99%
“…Machine learning and deep learning approaches are increasingly being applied to spectroscopy data and material characterization. We have previously reported on a deep learning approach that we applied to the study of complex IR spectroscopy data, in which we trained an artificial neural network β-variational autoencoder (β-VAE) on a data set of 25,000 IR spectra of PEX-a pipes . β-VAEs are deep generative models capable of learning disentangled (independent and interpretable) representations of the generative factors responsible for variance in data. ,, For PEX-a pipe IR spectra, the physicochemical processes that can occur during pipe aging, degradation, and cracking are the underlying generative factors responsible for the variance in the spectra .…”
Section: Introductionmentioning
confidence: 99%
“…During the training process, the model is evaluated on the validation set at regular intervals to monitor its performance and detect any signs of overfitting. If the model’s performance on the validation set starts to degrade while its performance on the training set continues to improve, this is a sign of overfitting, and the training process can be stopped [ 27 ]. The slightest validation error is indeed a good criterion to stop training the network.…”
Section: Methodsmentioning
confidence: 99%
“…Quantitative structure–retention relationships (QSRRs) represent the theoretical description of chromatographic retention behavior using physicochemical properties derived from the chemical structure of analytes and from the effect of chromatographic conditions [ 21 , 22 , 23 , 24 , 25 , 26 , 27 ]. A method of optimization to represent the correct geometry of each analyte is required to provide data for the calculations of molecular descriptors.…”
Section: Introductionmentioning
confidence: 99%
“…It has already contributed significantly to the prediction of the properties of materials and accelerated the development of molecular systems with specified properties. Machine learning algorithms are capable of analyzing complex data sets and uncovering hidden patterns that have greatly assisted researchers in understanding and designing new materials . The integration of experimental data with ML techniques has the potential to revolutionize the field of materials science, enabling scientists to make informed decisions and drive innovation in materials research.…”
mentioning
confidence: 99%