The discovery of novel materials and functional molecules can help to solve some of society’s most urgent challenges, ranging from efficient energy harvesting and storage to uncovering novel pharmaceutical drug candidates. Traditionally matter engineering–generally denoted as inverse design–was based massively on human intuition and high-throughput virtual screening. The last few years have seen the emergence of significant interest in computer-inspired designs based on evolutionary or deep learning methods. The major challenge here is that the standard strings molecular representation SMILES shows substantial weaknesses in that task because large fractions of strings do not correspond to valid molecules. Here, we solve this problem at a fundamental level and introduce SELFIES (SELF-referencIng Embedded Strings), a string-based representation of molecules which is 100% robust. Every SELFIES string corresponds to a valid molecule, and SELFIES can represent every molecule. SELFIES can be directly applied in arbitrary machine learning models without the adaptation of the models; each of the generated molecule candidates is valid. In our experiments, the model’s internal memory stores two orders of magnitude more diverse molecules than a similar test with SMILES. Furthermore, as all molecules are valid, it allows for explanation and interpretation of the internal working of the generative models.
Discovering and optimizing commercially viable materials for clean energy applications typically takes more than a decade. Self-driving laboratories that iteratively design, execute, and learn from materials science experiments in a fully autonomous loop present an opportunity to accelerate this research process. We report here a modular robotic platform driven by a model-based optimization algorithm capable of autonomously optimizing the optical and electronic properties of thin-film materials by modifying the film composition and processing conditions. We demonstrate the power of this platform by using it to maximize the hole mobility of organic hole transport materials commonly used in perovskite solar cells and consumer electronics. This demonstration highlights the possibilities of using autonomous laboratories to discover organic and inorganic materials relevant to materials sciences and clean energy technologies.
We
report Phoenics, a probabilistic global optimization algorithm
identifying the set of conditions of an experimental or computational
procedure which satisfies desired targets. Phoenics combines ideas
from Bayesian optimization with concepts from Bayesian kernel density
estimation. As such, Phoenics allows to tackle typical optimization
problems in chemistry for which objective evaluations are limited,
due to either budgeted resources or time-consuming evaluations of
the conditions, including experimentation or enduring computations.
Phoenics proposes new conditions based on all previous observations,
avoiding, thus, redundant evaluations to locate the optimal conditions.
It enables an efficient parallel search based on intuitive sampling
strategies implicitly biasing toward exploration or exploitation of
the search space. Our benchmarks indicate that Phoenics is less sensitive
to the response surface than already established optimization algorithms.
We showcase the applicability of Phoenics on the Oregonator, a complex
case-study describing a nonlinear chemical reaction network. Despite
the large search space, Phoenics quickly identifies the conditions
which yield the desired target dynamic behavior. Overall, we recommend
Phoenics for rapid optimization of unknown expensive-to-evaluate objective
functions, such as experimentation or long-lasting computations.
Fundamental advances to increase the efficiency as well as stability of organic photovoltaics (OPVs) are achieved by designing ternary blends, which represents a clear trend toward multicomponent active layer blends. The development of high‐throughput and autonomous experimentation methods is reported for the effective optimization of multicomponent polymer blends for OPVs. A method for automated film formation enabling the fabrication of up to 6048 films per day is introduced. Equipping this automated experimentation platform with a Bayesian optimization, a self‐driving laboratory is constructed that autonomously evaluates measurements to design and execute the next experiments. To demonstrate the potential of these methods, a 4D parameter space of quaternary OPV blends is mapped and optimized for photostability. While with conventional approaches, roughly 100 mg of material would be necessary, the robot‐based platform can screen 2000 combinations with less than 10 mg, and machine‐learning‐enabled autonomous experimentation identifies stable compositions with less than 1 mg.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.