If simulations generate the training data for AI models, then the quality, reproducibility, and traceability of simulations becomes critical.
Three requirements follow:
This is the thread of today's talk: physics → software → data.
where
Characteristic:
Classical continuum mechanics
Peridynamics (PD)
Silling (2000): "Reformulation of elasticity theory for discontinuities and long-range forces."
Key insight: The horizon
| Horizon ratio |
Effect |
|---|---|
| Good compromise | |
| Better convergence | |
| Classical CM |
| Formulation | Key idea | Poisson's ratio | Anisotropy |
|---|---|---|---|
| Bond-based (BB) | Pairwise spring force | No | |
| Ordinary state-based (OSB) | Collective, central force | Arbitrary | No |
| Non-ordinary state-based (NOSB) | Deformation gradient |
Arbitrary | Yes |
| Correspondence (CCM) | Arbitrary | Yes |
Correspondence model: compute
Shape tensor (computed once):
Deformation gradient:
Linearised strain:
Stress (any classical model):
Force density state:
Equation of motion:






Challenge: Material properties in 3D printing are created during the process
PD approach:
Why PD for AM?
Residual stresses cause delamination and cracking — exactly the discontinuous phenomena where PD is strongest. No remeshing as layers are added.
Simulation workflow:

Common scenario:
This matters for AI:
A concrete example: Peridigm
Dependencies: CMake, MPI, Python, Fortran, C, C++, Trilinos (Blas, Lapack, X11), Boost, HDF5, NetCDF-C
Installation time: 1 day to 1 week — if it works.
A solution that does not run requires infinite time to solve a problem.
Technical criteria
Community criteria
FAIR for software: Findable · Accessible · Interoperable · Reusable — the same principles applied to research data now increasingly applied to research software.
The two-language problem:
Julia solves this:
Performance comparison
(2×2 matrix inversion, inner loop)
| Method | Time |
|---|---|
inv(Matrix) Julia |
~302 ns |
StaticArrays Julia |
~2 ns |
| Equivalent C++ | ~2 ns |
Julia achieves C-level performance with Python-level readability — without sacrificing either.
Goal: Reduce the entry barrier to peridynamic simulation
Key decisions:
| Criterion | Peridigm | PeriLab |
|---|---|---|
| Installation | Manual | Package manager |
| Build time | ~10 min | < 5 min |
| New material | 5+ files | 1 file |
| New parameter | 6 files + compile | YAML only |
| Libraries | Manual update | Automatic |
| Compilation knowledge | Deep | None |
# One file, one module — no recompilation of the core
module MyPlasticModel
using PeriLab
function compute_stress(strain, history, params)
# your constitutive law here
σ = params["E"] .* strain
return σ, history
end
end
# Reference in the YAML input deck — that is all
Material Models:
MyMaterial:
Material Model: MyPlasticModel
E: 210000
The macro system discovers and integrates the module at runtime. No changes to the core codebase, no recompilation, no deep knowledge of the solver internals required.
PeriHub is a web-based front end for PeriLab
Target users:
Access:
perilab-results.nimbus-extern.dlr.de
Examples available:
Provenance = the complete history of how a result was produced
Why it matters:
If a simulation result is used to train an AI model, and later found to be wrong — can you trace back which training data to discard?
Without provenance: no.
Provenance is not just good practice — in regulated industries (aerospace, medical devices) it is a legal requirement. In research, it is the foundation of reproducibility.
| Principle | Meaning | In practice |
|---|---|---|
| Findable | Persistent identifier | DOI for every dataset |
| Accessible | Open access protocol | Zenodo, DaRUS, … |
| Interoperable | Standard formats | HDF5, JSON, CSV |
| Reusable | Licence + provenance | CC-BY + metadata |
For simulation data this means:
A Zenodo record with DOI costs nothing and lasts forever. A GitHub repository without a release tag disappears when the project ends.
What PeriLab records automatically:
What PeriHub adds:
The chain:
G-code / geometry
↓
PeriLab input (YAML, versioned)
↓
Simulation run (version + hash recorded)
↓
Output (HDF5 + metadata)
↓
PeriHub (interactive, shareable)
↓
Zenodo / repository (DOI, FAIR)
↓
AI training dataset (traceable)
The current situation:
What we can do differently:
You — as the next generation of engineers and researchers — will decide:
Will the simulation data generated in the next 10 years be usable by the AI models of 2040?
Only if it is stored correctly today.
Peridynamics
Sustainable Software
Data Provenance
The message:
Simulation and AI are not in competition. Good simulation software with traceable data is what makes AI in engineering reliable.
Prof. Dr.-Ing. Christian Willberg
Magdeburg-Stendal University of Applied Sciences
christian.willberg@h2.de
Resources: