Inferring the reaction pathways underlying the processing of complex feeds, using noisy data from spectral sensors that may contain information regarding molecular mechanisms, is challenging. This is tackled by a...
We use self-modeling multivariate curve resolution to identify pseudo-components and chemical transformations in thermal conversion of Athabasca bitumen.
Extracting meaningful information from spectroscopic data is key to species identification as a first step to monitoring chemical reactions in unknown complex mixtures. Spectroscopic data obtained over multiple process modes (temperature, residence time) from different sensors [Fourier transform infrared (FTIR), proton nuclear magnetic resonance ( 1 H NMR)] comprise hidden complementary information of the underlying chemical system. This work proposes an approach to jointly capture these hidden patterns in a structure-preserving and interpretable manner using coupled non-negative tensor factorization to achieve uniqueness in decomposition. Projections onto the modes of spectral channels, specific to each sensor, are interpreted as pseudo-component spectra, while projections onto the shared process modes are interpreted as the corresponding pseudo-component concentrations across temperature and residence times. Causal structure inference among these pseudo-component spectra (using Bayesian networks) is then used to identify plausible reaction pathways among the identified species representing each pseudo-component. Tensor decomposition of the FTIR data enables the development of reaction sequences based on the identified functional groups, while that of 1 H NMR by itself is lacking in mechanism development as it solely reveals the proton environments in a pseudocomponent. However, jointly parsing spectra from both the sensors is seen to capture complementary information, wherein insights into the proton environment from 1 H NMR disambiguate pseudo-components that have similar FTIR peaks. A scalable method of parallelizing tensor decomposition to handle high-dimensional modes in process data by using grid tensor factorization, while being robust to process data artifacts like outliers, noise, and missing data, has been developed.
Process systems engineering (PSE), as the name suggests, emphasizes an approach to understanding the behavior of systems as a whole with a view to improving decision-making for optimization and control of processes. The discipline emphasizes the application of mathematical techniques in this effort, and a plausible claim has been made that is at the very core of the discipline of chemical engineering. Being a generalized approach to process systems in general, it finds wide application to many areas in chemical engineering. This work reviews the application of PSE to the area of reaction engineering, which is also at the core of chemical engineering. We highlight the impactful applications of PSE in reaction engineering and discuss aspects of model building and analysis, reactor control, optimization, chemometrics, and chemoinformatics.
In this work, we present and validate a methodology for generating reaction networks from spectroscopic data using data-driven methods by applying it to the hydrothermal liquefaction (HTL) of Monterrey pine biomass and its constituents, viz., cellulose and lignin. This work is presented as a step toward automated inference of chemistry of the hydrothermal liquefaction process, thus limiting the need for human expertise. Bayesian hierarchical clustering of spectra and selfmodeling multivariate spectral curve resolution are used to generate groups of chemically similar species, the reaction networks among which have been developed using Bayesian networks. Fourier transform infrared spectroscopy and proton nuclear magnetic resonance spectroscopy-based measurements are used as input data. The data-driven reaction network includes pathways representing decomposition of the biomass components, large molecule hydrolysis, and reformation of produced molecules and is consistent with the literature. Furthermore, the comparison of the networks generated for biomass and its components (levoglucosan, representing cellulose, and 2-phenoxy-ethyl benzene, representing lignin) reveals the relationship between the biomass HTL reaction network and the reaction networks of the components. The data-driven approach provides a diagnostic tool to identify the most probable reaction chemistry for complex biomass feedstocks and can be used for process understanding, design, and control.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.