Simple and interpretable data-driven descriptor accurately predicts the synthesizability of single and double perovskites.
The lack of reliable methods for identifying descriptors -the sets of parameters capturing the underlying mechanisms of a materials property -is one of the key factors hindering efficient materials development. Here, we propose a systematic approach for discovering descriptors for materials properties, within the framework of compressed-sensing based dimensionality reduction. SISSO (sure independence screening and sparsifying operator) tackles immense and correlated features spaces, and converges to the optimal solution from a combination of features relevant to the materials' property of interest. In addition, SISSO gives stable results also with small training sets. The methodology is benchmarked with the quantitative prediction of the ground-state enthalpies of octet binary materials (using ab initio data) and applied to the showcase example of predicting the metal/insulator classification of binaries (with experimental data). Accurate, predictive models are found in both cases. For the metal-insulator classification model, the predictive capability are tested beyond the training data: It rediscovers the available pressure-induced insulator→metal transitions and it allows for the prediction of yet unknown transition candidates, ripe for experimental validation. As a step forward with respect to previous model-identification methods, SISSO can become an effective tool for automatic materials development.
Understanding Ostwald ripening and disintegration of supported metal particles under operating conditions has been of central importance in the study of sintering and dispersion of heterogeneous catalysts for long-term industrial implementation. To achieve a quantitative description of these complicated processes, an atomistic and generic theory taking into account the reaction environment, particle size and morphology, and metal− support interaction is developed. It includes (1) energetics of supported metal particles, (2) formation of monomers (both the metal adatoms and metal−reactant complexes) on supports, and (3) corresponding sintering rate equations and total activation energies, in the presence of reactants at arbitrary temperature and pressure. The thermodynamic criteria for the reactant assisted Ostwald ripening and induced disintegration are formulated, and the influence of reactants on sintering kinetics and redispersion are mapped out. Most energetics and kinetics barriers in the theory can be obtained conveniently by first-principles theory calculations. This allows for the rapid exploration of sintering and disintegration of supported metal particles in huge phase space of structures and compositions under various reaction environments. General strategies of suppressing the sintering of the supported metal particles and facilitating the redispersions of the low surface area catalysts are proposed. The theory is applied to TiO 2 (110) supported Rh particles in the presence of carbon monoxide, and reproduces well the broad temperature, pressure, and particle size range over which the sintering and redispersion occurred in such experiments. The result also highlights the importance of the metal−carbonyl complexes as monomers for Ostwald ripening and disintegration of supported metal catalysts in the presence of CO.
The availability of big data in materials science offers new routes for analyzing materials properties and functions and achieving scientific understanding. Finding structure in these data that is not directly visible by standard tools and exploitation of the scientific information requires new and dedicated methodology based on approaches from statistical learning, compressed sensing, and other recent methods from applied mathematics, computer science, statistics, signal processing, and information science. In this paper, we explain and demonstrate a compressed-sensing based methodology for feature selection, specifically for discovering physical descriptors, i.e., physical parameters that describe the material and its properties of interest, and associated equations that explicitly and quantitatively describe those relevant properties. As showcase application and proof of concept, we describe how to build a physical model for the quantitative prediction of the crystal structure of binary compound semiconductors.
The identification of descriptors of materials properties and functions that capture the underlying physical mechanisms is a critical goal in data-driven materials science. Only such descriptors will enable a trustful and efficient scanning of materials spaces and possibly the discovery of new materials. Recently, the sure-independence screening and sparsifying operator (SISSO) has been introduced and was successfully applied to a number of materials-science problems. SISSO is a compressed sensing based methodology yielding predictive models that are expressed in form of analytical formulas, built from simple physical properties. These formulas are systematically selected from an immense number (billions or more) of candidates. In this work, we describe a powerful extension of the methodology to a 'multi-task learning' approach, which identifies a single descriptor capturing multiple target materials properties at the same time. This approach is specifically suited for a heterogeneous materials database with scarce or partial data, e.g. in which not all properties are reported for all materials in the training set. As showcase examples, we address the construction of materials properties maps for the relative stability of octet-binary compounds, considering several crystal phases simultaneously, and the metal/insulator classification of binary materials distributed over many crystal prototypes.
Monthly observed wind speed data at 597 weather stations and NCEP wind speed data at 10 m above surface were used to explore the temporal variations of the wind speed for 1961-2007 in China. The results indicate that the temporal variation of annual wind speed in China has experienced four phases: two relatively steady periods from 1961 to 1968 and 1969 to 1974 with a sharp step change in 1969, a statistically significant decline stage from 1974 to 1990s, and another relatively steady period from 1990s to 2007. Except for the sharp step in 1969 being caused by the changes of observation instrument, other breakpoints correspond well with the positive and negative phases of the interdecadal Pacific oscillation. In addition, four different temporal variation patterns of annual wind speed in China have been identified by using cluster analysis and their spatial distributions were also explored.
Single-atom-alloy catalysts (SAACs) have recently become a frontier in catalysis research. Simultaneous optimization of reactants’ facile dissociation and a balanced strength of intermediates’ binding make them highly efficient catalysts for several industrially important reactions. However, discovery of new SAACs is hindered by lack of fast yet reliable prediction of catalytic properties of the large number of candidates. We address this problem by applying a compressed-sensing data-analytics approach parameterized with density-functional inputs. Besides consistently predicting efficiency of the experimentally studied SAACs, we identify more than 200 yet unreported promising candidates. Some of these candidates are more stable and efficient than the reported ones. We have also introduced a novel approach to a qualitative analysis of complex symbolic regression models based on the data-mining method subgroup discovery. Our study demonstrates the importance of data analytics for avoiding bias in catalysis design, and provides a recipe for finding best SAACs for various applications.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.