Automatic Mesh Generation by Machine Learning

Category: Analysis | Integrated 2026-04-06
Auto-meshing theory in CAE: mesh sizing field h(x,y) showing spatial variation of target element size from fine near boundaries to coarse in the interior
Theory & Physics

Automatic Mesh Generation by Machine Learning: Theoretical Foundations

Overview

๐Ÿง‘โ€๐ŸŽ“

Teacher! Today's topic is about automatic mesh generation using machine learning, right? What is it like?


๐ŸŽ“

It's an ML method that predicts optimal mesh parameters (element size, division count, etc.) from geometric shape features. It automates mesh generation that previously relied on empirical rules, reducing the burden on analysts.



Governing Equations


๐ŸŽ“

Expressing this with an equation, it looks like this.


$$\mathbf{h}^*(\mathbf{x}) = f_\theta(\text{GeomFeatures}(\mathbf{x}))$$

๐Ÿง‘โ€๐ŸŽ“

Hmm, just the equation doesn't really click... What does it represent?



$$\min_\theta \sum_{i}\|h_i^{pred} - h_i^{opt}\|^2 + \lambda \sum_{e}Q_e(h)$$

Theoretical Foundation

๐Ÿง‘โ€๐ŸŽ“

I've heard of "theoretical foundation," but I might not fully understand it...


๐ŸŽ“

Automatic mesh generation using machine learning is an important method aiming to fuse data-driven approaches and physics-based modeling. In conventional CAE analysis, computational cost is a major bottleneck, but by introducing machine learning-based automatic mesh generation, the trade-off between computational efficiency and prediction accuracy can be significantly improved. The mathematical foundation of this method is based on function approximation theory and statistical learning theory, with theoretical research challenges including guarantees of generalization performance and rigorous analysis of convergence. Particularly, dealing with the "curse of dimensionality" when the input dimension is high is a key practical issue, and approaches like dimensionality reduction and leveraging sparsity are important.


๐Ÿง‘โ€๐ŸŽ“

After hearing this, I finally understand why automatic mesh generation using machine learning is so important!


Details of Mathematical Formulation

๐Ÿง‘โ€๐ŸŽ“

Next is "Details of Mathematical Formulation"! What kind of content is this?


๐ŸŽ“

It shows the basic mathematical framework for applying machine learning models to CAE.



Loss Function Composition

๐Ÿง‘โ€๐ŸŽ“

What does "loss function composition" specifically mean?


๐ŸŽ“

The loss function in AIร—CAE is composed as a weighted sum of a data-driven term and a physics constraint term:



$$ \mathcal{L} = \lambda_d \mathcal{L}_{\text{data}} + \lambda_p \mathcal{L}_{\text{physics}} + \lambda_r \mathcal{L}_{\text{reg}} $$


๐ŸŽ“

Here, $\mathcal{L}_{\text{data}}$ is the squared error with observed data, $\mathcal{L}_{\text{physics}}$ is the residual of the governing equations, and $\mathcal{L}_{\text{reg}}$ is the regularization term. Adjusting the weight parameters $\lambda$ greatly affects learning stability and accuracy.




Generalization Performance and Extrapolation Problem

๐Ÿง‘โ€๐ŸŽ“

Please teach me about "Generalization Performance and the Extrapolation Problem"!


๐ŸŽ“

The biggest challenge for surrogate models is prediction accuracy outside the range of training data (extrapolation region). Incorporating physical laws can improve extrapolation performance, but complete guarantees are difficult.




Curse of Dimensionality

๐Ÿง‘โ€๐ŸŽ“

Please teach me about the "Curse of Dimensionality"!


๐ŸŽ“

When the dimension of the input parameter space is high, the required number of samples increases exponentially. Efficient sample placement using Active Learning or Latin Hypercube Sampling (LHS) is extremely important.



$$ N_{\text{samples}} \propto d^{\alpha}, \quad \alpha \geq 1 $$

Assumptions and Applicability Limits

๐Ÿง‘โ€๐ŸŽ“

Is this equation not universal? When can't it be used?


๐ŸŽ“
  • The training data sufficiently represents the physics of the analysis target.
  • The relationship between input parameters and output is smooth (if discontinuities exist, domain partitioning is necessary).
  • Reducing computational cost is the main purpose; conventional solvers should be used in conjunction for final verification requiring high accuracy.
  • If the quality of training data (mesh-converged, V&V completed) is insufficient, model reliability decreases.

๐Ÿง‘โ€๐ŸŽ“

Ah, I see! So that's how the mechanism of training data being the analysis target works.


Dimensionless Parameters and Dominant Scales

๐Ÿง‘โ€๐ŸŽ“

Teacher, please teach me about "Dimensionless Parameters and Dominant Scales"!


๐ŸŽ“

Understanding the dimensionless parameters governing the physical phenomenon being analyzed forms the basis for appropriate model selection and parameter setting.


๐ŸŽ“
  • Pรฉclet Number Pe: Relative importance of convection and diffusion. For Pe >> 1, convection dominates (stabilization methods are needed).
  • Reynolds Number Re: Ratio of inertial forces to viscous forces. A fundamental parameter for fluid problems.
  • Biot Number Bi: Ratio of internal conduction to surface convection. For Bi < 0.1, the lumped capacitance method is applicable.
  • Courant Number CFL: Indicator of numerical stability. For explicit methods, CFL โ‰ค 1 is required.

๐Ÿง‘โ€๐ŸŽ“

Ah, I see! So that's how the mechanism of the analysis target's physical phenomenon works.



Verification via Dimensional Analysis

๐Ÿง‘โ€๐ŸŽ“

Please teach me about "Verification via Dimensional Analysis"!


๐ŸŽ“

For order-of-magnitude estimation of analysis results, dimensional analysis based on Buckingham's ฮ  theorem is effective. Using characteristic length $L$, characteristic velocity $U$, and characteristic time $T = L/U$, the order of each physical quantity is estimated in advance to confirm the validity of the analysis results.


๐Ÿง‘โ€๐ŸŽ“

I see. So if the analysis target's physical phenomenon is understood, then it's basically okay?


Classification of Boundary Conditions and Mathematical Characteristics

๐Ÿง‘โ€๐ŸŽ“

I've heard that if you get the boundary conditions wrong, everything fails...


TypeMathematical ExpressionPhysical MeaningExample
Dirichlet Condition$u = u_0$ on $\Gamma_D$Specification of variable valueFixed wall, specified temperature
Neumann Condition$\partial u/\partial n = g$ on $\Gamma_N$Specification of gradient (flux)Heat flux, force
Robin Condition$\alpha u + \beta \partial u/\partial n = h$Linear combination of variable and gradientConvective heat transfer
Periodic Boundary Condition$u(x) = u(x+L)$Spatial periodicityUnit cell analysis
๐ŸŽ“

Choosing appropriate boundary conditions directly affects solution uniqueness and physical validity. Insufficient boundary conditions lead to an ill-posed problem, while excessive boundary conditions cause contradictions.



๐Ÿง‘โ€๐ŸŽ“

Wow, automatic mesh generation using machine learning is really deep... But thanks to your explanation, I've managed to organize my thoughts a lot!


๐ŸŽ“

Yeah, you're doing great! Actually getting hands-on is the best way to learn. If you don't understand something, feel free to ask anytime.


Coffee Break Yomoyama Talk

History of Automatic Mesh Generationโ€”From Delaunay Triangulation to Neural Networks

The theory of automatic mesh generation began with Delaunay triangulation in the 1960s. Using the geometric property that "no four points lie on the same circle," algorithms that generate optimal triangular meshes from point clouds still form the basis of 2D automatic meshing today. The Delaunay-Voronoi method extended to 3D and the Advancing Front method, which gradually stacks elements from the boundary, were mainstream from the 1990s to 2000s. Machine learning entered the scene around 2018. Methods emerged that analyze CAD shape B-Rep data with CNNs or GNNs to learn "this area should be dense" type of local judgments from data. The theoretical interest lies in how to fuse traditional geometric algorithms with learning-based "empirical judgment."

Computational Methods for Automatic Mesh Generation by Machine Learning

Details of Numerical Methods

๐Ÿง‘โ€๐ŸŽ“

Specifically, what algorithms are used to solve automatic mesh generation using machine learning?


๐ŸŽ“

Explains the numerical methods and algorithms for implementing automatic mesh generation using machine learning.


๐Ÿง‘โ€๐ŸŽ“

I see. So if automatic mesh generation is understood, then it's basically okay?


Discretization and Calculation Procedure

๐Ÿง‘โ€๐ŸŽ“

How do you actually solve this equation on a computer?


๐ŸŽ“

As data preprocessing, normalization/standardization of input features is important. Since CAE data have vastly different scales for each physical quantity, appropriate selection of Min-Max normalization or Z-score normalization is necessary. For learning algorithm selection, appropriate methods should be chosen according to data volume, dimensionality, and degree of nonlinearity.



Implementation Considerations

๐Ÿง‘โ€๐ŸŽ“

When using automatic mesh generation with machine learning in practice, what is the most important thing to be careful about?


๐ŸŽ“

Implementation using the Python ecosystem (scikit-learn, PyTorch, TensorFlow) is common. Keys to implementation are learning acceleration via GPU parallelization, automatic hyperparameter tuning, and preventing overfitting via cross-validation. For efficient I/O processing of large-scale CAE data, using the HDF5 format is recommended.



Verification Methods

๐Ÿง‘โ€๐ŸŽ“

Teacher, please teach me about "Verification Methods"!


๐ŸŽ“

It's important to use k-fold cross-validation, Leave-One-Out method, and holdout method appropriately for the purpose, and to evaluate prediction performance comprehensively using coefficient of determination Rยฒ, RMSE, MAE, and maximum error.


๐Ÿง‘โ€๐ŸŽ“

Now I understand what my senior meant when they said, "At least do cross-validation properly."


Code Quality and Reproducibility

๐Ÿง‘โ€๐ŸŽ“

When using automatic mesh generation with machine learning in practice, what is the most important thing to be careful about?


๐ŸŽ“

Ensure code quality and experiment reproducibility by introducing version control (Git), automated testing (pytest), and CI/CD pipelines. Strictly enforce dependency library version pinning (requirements.txt) to make rebuilding the computational environment easy. Ensuring result reproducibility by fixing random seeds is also an important implementation practice.