Learning Operators with Coupled Attention

Kissas, Georgios; Seidman, J. S.; Guilhoto, Leonardo Ferreira; Preciado, Víctor M.; Pappas, George J.; Perdikaris, Paris

doi:10.48550/arxiv.2201.01032

Cited by 5 publications

(4 citation statements)

References 38 publications

(60 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When comparing the prediction accuracy from different models, similar to the previous examples, the FNO suffers from overfitting and vanishing gradient issues when L > 2, especially in the original (more noisy) dataset. This finding is consistent with the results reported in [68,103], where the performance of the FNOs was found to be deteriorated on noisy datasets. In contrast, the accuracy of the IFNOs monotonically improves with the increase of L. Both neural operator models outperforms the conventional constitutive modeling approaches by around one order of magnitude.…”

Section: Resultssupporting

confidence: 92%

Learning Deep Implicit Fourier Neural Operators (IFNOs) with Applications to Heterogeneous Material Modeling

You,

Zhang,

Ross

et al. 2022

Preprint

View full text Add to dashboard Cite

Constitutive modeling based on continuum mechanics theory has been a classical approach for modeling the mechanical responses of materials. However, when constitutive laws are unknown or when defects and/or high degrees of heterogeneity are present, these classical models may become inaccurate. In this work, we propose to use data-driven modeling, which directly utilizes high-fidelity simulation and/or experimental measurements to predict a material's response without using conventional constitutive models. Specifically, the material response is modeled by learning the implicit mappings between loading conditions and the resultant displacement and/or damage fields, with the neural network serving as a surrogate for a solution operator. To model the complex responses due to material heterogeneity and defects, we develop a novel deep neural operator architecture, which we coin as the Implicit Fourier Neural Operator (IFNO). In the IFNO, the increment between layers is modeled as an integral operator to capture the long-range dependencies in the feature space. As the network gets deeper, the limit of IFNO becomes a fixed point equation that yields an implicit neural operator and naturally mimics the displacement/damage fields solving procedure in material modeling problems. To obtain an efficient implementation, we parameterize the integral kernel of this integral operator directly in the Fourier space and interpret the network as discretized integral (nonlocal) differential equations, which consequently allow for the fast Fourier transformation (FFT) and accelerated learning techniques for deep networks. We demonstrate the performance of our proposed method for a number of examples, including hyperelastic, anisotropic and brittle materials. As an application, we further employ the proposed approach to learn the material models directly from digital image correlation (DIC) tracking measurements, and show that the learned solution operators substantially outperform the conventional constitutive models in predicting displacement fields.

show abstract

Section: Resultssupporting

confidence: 92%

Learning Deep Implicit Fourier Neural Operators (IFNOs) with Applications to Heterogeneous Material Modeling

You,

Zhang,

Ross

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…In the regularized cavity flow problem presented in [17], an additional trigonometric input function was included in the branch network to incorporate periodic boundary conditions. Similarly, an additional input function is included in [20], which is created by averaging a feature embedded in the inputs of the branch network over probability distributions that depend on the corresponding query locations of the output function, employing the kernelcoupled attention mechanism, which allows the operator to accurately model correlations between the query locations of the output functions.…”

Section: Feature Expansion In Deeponetmentioning

confidence: 99%

Physics-Informed Deep Neural Operator Networks

Goswami¹,

Bora²,

Yu³

et al. 2022

Preprint

View full text Add to dashboard Cite

Standard neural networks can approximate general nonlinear operators, represented either explicitly by a combination of mathematical operators, e.g., in an advection-diffusion-reaction partial differential equation, or simply as a black box, e.g., a system-of-systems. The first neural operator was the Deep Operator Network (DeepONet), proposed in 2019 based on rigorous approximation theory. Since then, a few other less general operators have been published, e.g., based on graph neural networks or Fourier transforms. For black box systems, training of neural operators is data-driven only but if the governing equations are known they can be incorporated into the loss function during training to develop physics-informed neural operators. Neural operators can be used as surrogates in design problems, uncertainty quantification, autonomous systems, and almost in any application requiring real-time inference. Moreover, independently pre-trained DeepONets can be used as components of a complex multi-physics system by coupling them together with relatively light training. Here, we present a review of DeepONet, the Fourier neural operator, and the graph neural operator, as well as appropriate extensions with feature expansions, and highlight their usefulness in diverse applications in computational mechanics, including porous media, fluid mechanics, and solid mechanics.

show abstract

“…Guo et al [29] introduces attention as an instance-based learnable kernel for direct sampling method and demonstrates superiority on boundary value inverse problems. Learning operators with coupled attention [32] uses attention weights to learn correlations in the output domain and enables sample-efficient training of the model. General neural operator transformer for operator learning [25] proposes a heterogeneous attention architecture that stacks multiple cross-attention layers and uses a geometric gating mechanism to adaptively aggregate features from query points.…”

Section: Introductionmentioning

confidence: 99%

Physics informed token transformer for solving partial differential equations

Lorsung,

Li,

Barati Farimani

2024

Mach. Learn.: Sci. Technol.

View full text Add to dashboard Cite

Solving Partial Differential Equations (PDEs) is the core of many fields of science and engineering. While classical approaches are often prohibitively slow, machine learning models often fail to incorporate complete system information. Over the past few years, transformers have had a significant impact on the field of Artificial Intelligence and have seen increased usage in PDE applications. However, despite their success, transformers currently lack integration with physics and reasoning. This study aims to address this issue by introducing PITT: Physics Informed Token Transformer. The purpose of PITT is to incorporate the knowledge of physics by embedding partial differential equations (PDEs) into the learning process. PITT uses an equation tokenization method to learn an analytically-driven numerical update operator. By tokenizing PDEs and embedding partial derivatives, the transformer models become aware of the underlying knowledge behind physical processes. To demonstrate this, PITT is tested on challenging 1D and 2D PDE neural operator prediction tasks. The results show that PITT outperforms popular neural operator models and has the ability to extract physically relevant information from governing equations.

show abstract

Learning Operators with Coupled Attention

Cited by 5 publications

References 38 publications

Learning Deep Implicit Fourier Neural Operators (IFNOs) with Applications to Heterogeneous Material Modeling

Learning Deep Implicit Fourier Neural Operators (IFNOs) with Applications to Heterogeneous Material Modeling

Physics-Informed Deep Neural Operator Networks

Physics informed token transformer for solving partial differential equations

Contact Info

Product

Resources

About