Deep Learning — CAE Glossary
Deep Learning Overview
Definition
Deep Learning is a type of machine learning method using multi-layer neural networks. In CAE, it is widely applied to building surrogate models that rapidly approximate computationally expensive numerical simulations (FEM, CFD, etc.), PINN which incorporates physical laws into network learning, transfer learning for adapting to new problems with minimal data, and CNN-based methods for predicting physical fields as images.
The basic structure of a neural network applies a series of nonlinear transformations to input $\mathbf{x}$ to obtain output $\hat{\mathbf{y}}$:
$$\hat{\mathbf{y}} = f_L \circ f_{L-1} \circ \cdots \circ f_1(\mathbf{x}), \quad f_l(\mathbf{z}) = \sigma(\mathbf{W}_l \mathbf{z} + \mathbf{b}_l)$$where $\mathbf{W}_l$ is the weight matrix, $\mathbf{b}_l$ is the bias vector, and $\sigma$ is an activation function (ReLU, GELU, tanh, etc.).
Surrogate Models
How is deep learning used in the CAE world? Does it replace FEM or CFD?
It doesn't completely replace them, but its use as a surrogate model (proxy model) is growing. For example, in automotive crash simulation, a single FEM calculation takes hours. But if you pre-train a neural network on hundreds or thousands of simulation results, you can predict stress distributions for new design parameters in milliseconds.
We hear about 1000x speedup, but is the accuracy reliable? Can it be used for design decisions?
Good observation. Surrogate model accuracy depends heavily on training data quantity and quality. In practice, the mainstream approach is: use surrogate models for design exploration and screening, then validate final candidates with traditional FEM/CFD. For example, screen 10,000 design candidates with a surrogate model, then run full analysis on only the top 100.
In surrogate model training, we have a dataset of input parameters $\boldsymbol{\mu} = (\mu_1, \mu_2, \ldots, \mu_d)$ (material properties, geometric parameters, boundary conditions, etc.) paired with corresponding physical quantities $\mathbf{u}(\boldsymbol{\mu})$, and minimize the loss function:
$$\mathcal{L}_{\text{data}} = \frac{1}{N}\sum_{i=1}^{N} \left\| \hat{\mathbf{u}}(\boldsymbol{\mu}_i; \boldsymbol{\theta}) - \mathbf{u}_i \right\|^2$$where $\boldsymbol{\theta}$ represents all network parameters (weights and biases).
PINN (Physics-Informed Neural Networks)
PINN has been appearing frequently lately. How does it differ from ordinary neural networks?
PINN stands for Physics-Informed Neural Network. The key feature is incorporating the residual of governing equations into the loss function. Ordinary DL learns only from data, but PINN directly teaches the network that "this solution must satisfy the Navier-Stokes equation" or other physical constraints.
Specifically, how do you incorporate physical laws? I can't quite visualize writing equations into a loss function...
Let's consider steady-state heat conduction. The governing equation is $\nabla^2 T = 0$. In PINN, you compute the second partial derivatives of network output $\hat{T}(x,y)$ using automatic differentiation, then add a penalty term to the loss function to make $\nabla^2 \hat{T}$ approach zero. Physical laws act as constraints even in data-sparse regions, so non-physical solutions are less likely. A common application is inverse problems—for example, inferring material thermal conductivity from sensor data at a few points.
The total PINN loss function typically takes the form:
$$\mathcal{L}_{\text{PINN}} = \underbrace{\lambda_{\text{data}} \mathcal{L}_{\text{data}}}_{\text{consistency with observations}} + \underbrace{\lambda_{\text{PDE}} \mathcal{L}_{\text{PDE}}}_{\text{PDE residual}} + \underbrace{\lambda_{\text{BC}} \mathcal{L}_{\text{BC}}}_{\text{boundary condition residual}}$$The PDE residual term is computed using automatic differentiation:
$$\mathcal{L}_{\text{PDE}} = \frac{1}{N_r}\sum_{i=1}^{N_r}\left| \mathcal{N}[\hat{u}](\mathbf{x}_i) \right|^2$$where $\mathcal{N}[\cdot]$ is a differential operator (e.g., $\nabla^2 u - f$ or $\rho(\mathbf{u}\cdot\nabla)\mathbf{u} + \nabla p - \mu\nabla^2\mathbf{u}$).
Transfer Learning
How is transfer learning used in CAE? I only think of image recognition applications.
The principle is the same as in image recognition. You reuse the weights from a pre-trained model on a similar physical phenomenon as initial values. For instance, if you have a model trained on 5000 cases of NACA 0012 airfoil aerodynamics, transferring it to NACA 4412 achieves high accuracy with only 200–300 additional data points. From scratch, you'd need about 3000 cases.
So it reduces the cost of generating simulation data itself! That's quite practical.
Exactly. In the CAE world, generating a single data point takes minutes to hours, so reducing training data generation costs has enormous impact. Transfer learning can reduce that cost by an order of magnitude. However, if the source and target tasks are too dissimilar, you risk negative transfer, so careful assessment of physical similarity is essential.
CNN-based Field Prediction
I heard CNN can directly predict stress distributions and temperature fields. But meshes aren't images, so how does the input work?
Great question. There are several approaches, but the most intuitive is SDF (Signed Distance Field) representation. You convert the analysis domain geometry to a grid image where each pixel stores the distance to the object boundary. This becomes an input channel for CNN, and the output channel predicts the stress or temperature contour.
So you represent shape as an image and physical quantities as output images. What network architectures are used?
U-Net and Encoder-Decoder architectures are standard. The structure is the same as in image segmentation, but input replaces photos with SDF or boundary condition maps, and output replaces segmentation masks with stress or velocity contours. Skip connections help preserve spatial fine structure. Inference completes in milliseconds, enabling real-time design interfaces.
Can it handle 3D problems? Images seem inherently 2D.
3D CNN can handle it, but memory grows cubically, making resolutions around $128^3$ the practical limit. Recently, Graph Neural Network (GNN) approaches are emerging, handling unstructured meshes directly as graphs with greater mesh freedom. Point cloud-based techniques using PointNet are also advancing.
Practical Considerations
What should practitioners be careful about when deploying deep learning in CAE?
The most important is caution regarding extrapolation. Neural networks are accurate within the training data range (interpolation) but predictions break down for unseen conditions—unfamiliar material properties or extreme loads. Physics-based simulators have conservation laws built in and can handle extrapolation reasonably well, but pure data-driven models lack that guarantee. Either incorporate physics like PINN, or explicitly evaluate reliability in extrapolation regions through V&V.
So is deep learning the CAE field's "ally" or "rival"?
Definitely an ally. Traditional FEM/CFD and deep learning are complementary. FEM/CFD is physically rigorous but slow; deep learning is fast but has limited generalization. The hybrid approachdigital twin and real-time control contexts.
Related Terms
- Surrogate Model: A low-cost model approximating expensive simulations. Beyond deep learning, includes kriging and RBF.
- PINN (Physics-Informed Neural Network): Neural network incorporating PDE residuals in the loss function.
- Transfer Learning: Technique reusing knowledge from one task to a similar different task.
- CNN (Convolutional Neural Network): Neural network using convolutional layers, excellent for image-like spatial data.
- Automatic Differentiation: Core PINN technology that accurately computes partial derivatives along computational graphs.
- GNN (Graph Neural Network): Network handling unstructured meshes directly as graphs.
Precise understanding of CAE terminology is the foundation of team communication. — Project NovaSolver aims to support practitioners' learning.
Share your challenges with Deep Learning × CAE
Project NovaSolver addresses challenges CAE engineers face daily—setup complexity, computational cost, result interpretation. Your real-world experience drives better tool development.
Contact Us (coming soon)Related Topics
details
error