Abstract

We present a Kokkos-accelerated implementation of the Moment Tensor Potential (MTP) for LAMMPS, designed to improve both computational performance and portability across CPUs and GPUs. This package introduces an optimized CPU variant—achieving up to 2× speedups over existing implementations—and two new GPU variants: a thread-parallel version for large-scale simulations and a block-parallel version optimized for smaller systems. It supports three core functionalities: standard inference, configuration-mode active learning, and neighborhood-mode active learning. Benchmarks and case studies demonstrate efficient scaling to million-atom systems, substantially extending accessible length and time scales while preserving the MTP’s near-quantum accuracy and native support for uncertainty quantification.

Moment Tensor Potential ,GPU acceleration ,Kokkos ,Molecular Dynamics ,Active learning ,High-performance computing

1 Motivation and significance↩︎

Atomistic simulations have become a vital complement to experimental methods in materials discovery and characterization [1]–[3]. These simulations rely on interatomic potentials—models of atomic interactions—to compute energies and forces. Traditional potentials are computationally efficient but often lack the flexibility needed for high-fidelity predictions, while quantum mechanical methods such as density functional theory (DFT) offer greater accuracy at substantially higher computational cost. Machine learning interatomic potentials (MLIPs) have emerged as a compelling alternative, providing a systematic framework to improve accuracy by increasing model complexity, thereby enabling better control over the cost-accuracy trade-off than conventional approaches [4], [5]. However, the computational demands of high-capacity MLIPs remain a challenge, especially for large-scale or long-timescale simulations.

The rise of MLIPs has paralleled growing demand for computational power, making hardware accelerators—particularly GPUs—essential for large-scale simulations. At the same time, the landscape of high-performance computing (HPC) has become increasingly heterogeneous. Leading supercomputers such as LUMI, Frontier, and the newly commissioned El Capitan employ AMD Instinct GPUs (MI250X, MI300A) [6], while Aurora relies on Intel Max GPUs. However, fully leveraging these diverse architectures often requires adopting distinct programming models, vendor-specific frameworks, or low-level languages, posing a significant challenge for portability and maintainability.

LAMMPS—Large-scale Atomic/Molecular Massively Parallel Simulator—a widely used classical atomistic simulation engine [7], [8], supports Kokkos-based acceleration to enable performance portability across diverse hardware platforms. Kokkos provides a unified abstraction for parallel execution and data management [9], [10], allowing a single implementation to target CPUs, NVIDIA GPUs, AMD GPUs, and other architectures without sacrificing performance. Several interatomic potentials—such as the Tabulated Gaussian Approximation Potential (tabGAP) [11], [12], the Spectral Neighbor Analysis Potential (SNAP) [13], and the Atomic Cluster Expansion (ACE/PACE) [14], [15]—have been implemented in this framework, and have demonstrated scalability to billion-atom systems and nanosecond timescales [16]. These capabilities bring large-scale, high-fidelity simulations closer to experimentally relevant conditions in both space and time.

We extend the LAMMPS Kokkos package to support the Moment Tensor Potential (MTP)[17], including both inference and uncertainty quantification for active learning. MTP is one of the most widely used machine learning interatomic potentials, with demonstrated success across metals, semiconductors, and multicomponent systems [18]–[31]. It offers a strong balance of computational efficiency, systematic improvability, and built-in support for active learning via extrapolation grades based on D-optimality [32] and the MaxVol algorithm [33]. While newer formalisms such as ACE have outperformed MTP on recent Pareto fronts of accuracy versus cost [34], ongoing work has shown that careful adjustments to the basis construction can further improve its performance and expressiveness [25]. Integrating MTP into the Kokkos framework within LAMMPS enables scalable, portable deployment across GPU architectures, significantly broadening the range of accessible simulation sizes and timescales.

The software is available on GitHub¹ [35].

2 Software description↩︎

2.1 Software architecture↩︎

We introduce nine new implementations of the Moment Tensor Potential, each optimized for different simulation conditions to provide flexibility across a wide range of system sizes and hardware configurations. These cover three core use cases: inference, active learning in configuration mode, and active learning in neighborhood mode. For each use case, we provide three implementations: (1) a further optimized CPU version (non-Kokkos) that improves upon the original MLIP-3 package [36]; (2) a thread-parallel GPU variant designed for large-scale simulations (typically \(\gtrsim\)50,000 atoms per GPU); and (3) a block-parallel GPU variant optimized for smaller simulations (typically \(\gtrsim\)2000 atoms per GPU). The block-parallel version, denoted with small in the LAMMPS pair_style, exposes additional fine-grained parallelism, which can improve performance at small-to-intermediate system sizes but may reduce peak throughput. Each variant follows LAMMPS pair_style naming conventions, as summarized in Table 1.

Table 1: Each MTP variation and its LAMMPS `pair_style` identifier.
Platform		Use Cases
Platform		Inference	Active Learning, Both Modes
CPU		`mtp`	`mtp/extrapolation`
GPU	Thread-Parallel	`mtp/kk`	`mtp/extrapolation/kk`
GPU	Block-Parallel	`mtp/small/kk`	`mtp/extrapolation/small/kk`

Users should first select the desired use case. Inference computes energies, forces, and stresses during standard molecular dynamics simulations. Active learning performs the same calculations while also evaluating the extrapolation grade at user-defined intervals, enabling on-the-fly model improvement. Configuration mode and neighborhood mode refer to two distinct strategies for computing this grade. As in MLIP-3, both modes support a selection threshold—above which configurations are written to disk—and a break threshold, above which the simulation is halted. Once the use case is chosen, the appropriate implementation (CPU, thread-parallel GPU, or block-parallel GPU) can be selected. For GPU usage, we recommend short single-GPU trial runs of both variants, as relative performance depends strongly on the MTP parameters and the underlying hardware.

2.2 Software functionalities↩︎

The main contributions of the software are improved CPU performance and new GPU capabilities, while fully preserving the core functionality of the MTP as described in previous works [17], [36]. To evaluate these improvements, we benchmarked both weak and strong scaling on the Digital Research Alliance of Canada’s Narval HPC cluster. Narval’s CPU nodes are equipped with 2× AMD EPYC™ 7532 processors (32 cores each), and its GPU nodes with 4× NVIDIA A100 SXM4 (40 GB). MTP models are characterized by their “level”, which exponentially scales the number of basis functions—and thus the computational cost. Figure 1 (a) presents weak and strong scaling results across a range of MTP levels, using a quarter, half, and full Narval node (CPU and GPU). Except for the MTP level, all benchmarks use default hyperparameters (e.g., cutoff radius) for a bulk simulation of unstrained solid potassium. The complete LAMMPS input script and full benchmarking data are provided in a separate repository [37].

Figure 2: The active learning speedups (relative maximum throughput) for both configuration and neighborhood mode over previous the MLIP-3 implementation.

For each MTP level, we evaluated the maximum throughput—measured in atom-timesteps per wall-time second—across all tested atom counts and benchmark trials, and report relative speedups. Inference speedups compared to the original MLIP-3 implementation are shown in Figure 1, using a single CPU core and a single NVIDIA A100 GPU. Figure 2 presents corresponding speedups for active learning in both configuration and neighborhood modes, evaluated across systems containing one, two, or three atomic species.

Notably, the crossover point at which the thread-parallel variant outperforms the block-parallel variant depends strongly on both the atom count and MTP level, as well as the underlying hardware. Overall, the observed GPU speedups are comparable to the acceleration achieved by the existing Kokkos implementation of the ACE potential when comparing a single A100 GPU to a single CPU core [34].

2.3 Sample Code Snippets↩︎

As with other interatomic potentials in LAMMPS, MTP variants are specified using the pair_style command, followed by the path to the MTP potential file. These files are backward-compatible with the MLIP-3 format. For GPU variants, a chunk size must be provided using the chunksize keyword to manage memory usage. If the total number of atoms exceeds the specified chunk size, the simulation proceeds in multiple chunks. For optimal performance, the chunk size should be tuned to ensure sufficient parallelism while avoiding excessive memory usage (which can lead to contention) and minimizing the occurrence of a small final chunk (which can degrade performance due to underutilization).

pair_style mtp path/to/mtp/file
pair_style mtp/kk path/to/mtp/file chunksize 32768
pair_style mtp/small/kk path/to/mtp/file chunksize 32768

The mode, either configuration or neighborhood, is read from the MTP file. For neighborhood active learning variations, we support both LAMMPS-like and MLIP-3-like processing of extrapolation grades. In the former style, the pair_style is invoked as in inference.

pair_style mtp/extrapolation path/to/mtp/file
pair_style mtp/extrapolation/kk path/to/mtp/file chunksize 32768
pair_style mtp/extrapolation/small/kk path/to/mtp/file chunksize 32768

A fix is then required to request extrapolation grades every X timesteps.

fix mtp_grade all pair X mtp/extrapolation extrapolation 1
fix mtp_grade all pair X mtp/extrapolation/kk extrapolation 1
fix mtp_grade all pair X mtp/extrapolation/small/kk extrapolation 1

The neighborhood extrapolation grades can then be accessed through the f_mtp_grade variable. LAMMPS’s dump can then be used to periodically write the grades, and other desired per-atom properties to a file. Notably, if the user attempts to access grades on timesteps where extrapolation is not being calculated, the values will not be up-to-date.

dump my_dump all custom X path/to/dump f_mtp_grade

In the MLIP-3 style, the user specifies in order, the MTP file, the output file, the selection threshold, and the break threshold. Extrapolation is evaluated every timestep, and should the maximum grade surpass the selection threshold, the current configuration is written to the output file in the MLIP-3 format. Should this maximum grade surpass the break threshold, the simulation is immediately terminated. GPU variations still require the chunk size.

pair_style mtp/extrapolation path/to/mtp/file \;
    path/to/output 2 10
pair_style mtp/extrapolation/kk path/to/mtp/file \;
    path/to/output 2 10 chunksize 32768
pair_style mtp/extrapolation/small/kk path/to/mtp/file \;
    path/to/output 2 10 chunksize 32768

Configuration mode is only available with the MLIP-3 style. In either mode, the maximum extrapolation grade at each time step is available as a LAMMPS variable through a LAMMPS compute.

compute max_grade all pair mtp/extrapolation
compute max_grade all pair mtp/extrapolation/kk
compute max_grade all pair mtp/extrapolation/small/kk

The variable can be accessed as usual through c_max_grade[1] and used in fix halt. Commonly, the user will print the grade along with other per-timestep quantities at regular intervals with LAMMPS thermo. Notably, if the user attempts to access this variable on timesteps where extrapolation is not being calculated, the value will not be up-to-date.

thermo_style custom step c_max_grade[1]
thermo X

Much like some other MLIPs in LAMMPS, when invoking a LAMMPS script utilizing a MTP Kokkos GPU variation through the command line, additional flags are required:

-pk kokkos newton on neigh half

A full example LAMMPS script and its command-line invocation are available in the supplementary materials [37].

3 Illustrative examples↩︎

We present three illustrative examples where GPU acceleration provides substantial benefits: (1) a large-scale simulation using a high-cost MTP, (2) a very large simulation with a medium-cost MTP, and (3) a medium-sized simulation that demonstrates active learning in practice. The LAMMPS input scripts for all examples are included in the supplementary materials [37]. Visualizations were produced using OVITO [38].

3.1 Dislocation Glide in Silicon↩︎

Dislocations are crystallographic line defects that disrupt the regular atomic structure of a material and often arise under mechanical stress or elevated temperatures. Once nucleated, they can propagate and multiply, affecting plasticity and deformation mechanisms. In semiconductors—critical to technologies such as transistors, LEDs, and solar cells—dislocations can influence fabrication processes, carrier transport, and overall device performance. Accurate simulation of dislocation behavior requires large simulation cells to capture long-range elastic fields and collective dislocation dynamics. As a case study, we generated a silicon cell containing approximately 115,000 atoms with a screw dislocation characterized by a C1-type core using Atomsk [39]. The structure was relaxed to the more stable C2 configuration, which is energetically favored in diamond cubic crystals. Molecular dynamics simulations were then performed using the isothermal–isobaric (NPT) ensemble and a level-26 silicon–oxygen MTP developed by Zongo et al.[20]. A shear strain of \(5\times10^7\) s\(^{-1}\) was applied over a 1 ns simulation at 10 K to evaluate dislocation mobility. Results are shown in Figure 3.

Figure 3: Shearing of a C2 core type screw dislocation in silicon (115 thousand atoms, \(5\times10^7\) s\(^{-1}\) strain rate, 1 ns, 1 fs timestep). Top-Left: unstrained. Bottom-Left: strained. Right: engineering shear stress-strain curve..

Using the block-parallel inference variant on a full Narval GPU node, we achieved a simulation rate of 0.515 ns/day. In contrast, the original MLIP-3 implementation running on a full Narval CPU node achieved only 0.032 ns/day in a short trial run—a rate that is impractically slow for this type of simulation.

3.2 Nanocrystalline Tension of Aluminum↩︎

Nanocrystalline tension simulations are widely used to investigate defect dynamics, grain boundary behavior, and mechanical properties in materials with nanometer-scale grain sizes. To avoid artificial periodicity effects—where grains interact with their own periodic images—simulations must include a sufficiently large number of grains, often requiring millions of atoms for realistic structures. As a representative case, we generated an aluminum polycrystal using Atomsk [39] and employed a level-16 MTP from Novikov et al.[19]. The system contained one million atoms with a mean grain size of 11.1 nm. We performed uniaxial tensile deformation at 300K to a strain of 0.1 over 1 ns, corresponding to a strain rate of \(10^8\) s\(^{-1}\). Results are presented in Figure 4. Note that this potential was selected for demonstration purposes and was not specifically validated for this material or deformation mode.

Figure 4: Nanocrystalline tension of aluminum (1.00 million atoms, \(10^8\) s\(^{-1}\) strain rate, 1 ns, 1 fs timestep). Left: unstrained. Center: strained (0.1 strain). Right: engineering stress-strain curve..

Using the thread-parallel inference variant on a full Narval GPU node, we achieved a simulation rate of 0.453 ns/day. In comparison, the original MLIP-3 implementation on a full Narval CPU node yielded only 0.026 ns/day in a short trial run—a prohibitively slow rate for a simulation of this scale.

3.3 Active Learning in a Coexistence Simulation↩︎

After training an MTP on quantum-mechanical data, uncertainty quantification can be employed during early production MD simulations to assess the model’s reliability. In one such case, we developed a sodium–potassium alloy potential and sought to determine the eutectic melting point using a 3600-atom coexistence simulation featuring a solid–liquid interface (BCC + C14 + liquid), shown in Figure 5. To ensure the level-18 MTP remained reliable when applied to potentially out-of-distribution configurations, we enabled active learning. This approach reduced the need to construct an excessively large validation set using quantum methods, while still assessing regions of potentially high extrapolation.

Figure 5: A eutectic sodium (red) and potassium (blue) solid-liquid interface which may be out-of-distribution and whose uncertainty was thus tested with active learning enabled. Solid (C14 + BCC) pictured left; liquid pictured right.

We performed this active learning simulation in configuration mode using both MLIP-3 on a full Narval CPU node and our block-parallel GPU implementation on a single A100. The MLIP-3 implementation achieved a simulation rate of 0.535 ns/day, while the GPU version reached 14.925 ns/day. Notably, despite the system size being well within the range typically amenable to MPI parallelization, MLIP-3’s configuration mode exhibited limited scalability. This inefficiency would likely worsen if additional CPU nodes were used in an attempt to improve the simulation rate.

4 Impact and Conclusions↩︎

When exploring, discovering, and characterizing materials through atomistic simulations, many phenomena of interest require simulations involving millions of atoms. Examples include amorphous materials [40], crack propagation [41], nanocrystalline systems [42], irradiation damage [43], and dislocation dynamics and plasticity [44].

Despite its popularity, demonstrated successes, native support for active learning, and strong computational efficiency, the MTP has lacked GPU support—limiting its scalability on modern HPC systems. By introducing a family of GPU-accelerated variants, we significantly expand MTP’s applicability to larger, more complex simulations. In particular, the block-parallel implementation achieves peak throughput with approximately 2000 atoms per GPU, enabling faster time-to-solution for many existing problems and making it feasible to deploy higher-level, more accurate MTPs that would otherwise be computationally prohibitive. These GPU implementations also enable medium-scale simulations on consumer-grade hardware or with Multi-Instance GPU (MIG), improving accessibility across a broader range of research environments. Additionally, our optimized CPU variant consistently outperforms the original implementation, with speedups of up to 2\(\times\).

This software contribution is part of a collective effort to improve upon the MTP and other similar potentials such as the Equivariant Tensor Network Potential, the latter of which could be improved to support Kokkos.

Acknowledgements↩︎

We would like to acknowledge Alexander Shapeev and Ivan Novikov for their insight and discussions regarding the MTP implementation and future work.

Funding Sources↩︎

This work was funded by the University Network for Excellence in Nuclear Engineering (UNENE), the Natural Sciences and Engineering Research Council of Canada (NSERC), and Mitacs. We thank the Digital Research Alliance of Canada (DRAC) for the generous allocation of computer resources. This work was partly funded by Atomic Energy of Canada Limited, Canada, under the auspices of the Federal Nuclear Science and Technology Program.

CRediT statement↩︎

Zijian Meng: Conceptualization, Methodology, Software, Validation, Visualization, Investigation, Data Curation, Writing - Original Draft, Writing - Review & Editing Karim Zongo: Validation, Writing - Review & Editing Edmanuel Torres: Writing - Review & Editing Christopher Maxwell: Writing - Review & Editing Ryan Grant: Methodology, Resources, Writing - Review & Editing, Supervision, Funding acquisition Laurent Karim Béland: Methodology, Resources, Writing - Review & Editing, Supervision, Funding acquisition, Project administration

Declaration of generative AI and AI-assisted technologies in the writing process↩︎

During the preparation of this work, the authors used Gemini in order to help write plotting scripts in Python for benchmark data. After using this tool, the authors reviewed and edited the plots as needed and take full responsibility for the content of the published article.

References↩︎

[1]

R. LeSar, Introduction to computational materials science: fundamentals to applications, Cambridge University Press, 2013.

[2]

S. Gowthaman, A review on mechanical and material characterisation through molecular dynamics using large-scale atomic/molecular massively parallel simulator (lammps), Functional Composites and Structures 5 (1) (2023) 012005.

[3]

S. A. Hollingsworth, R. O. Dror, Molecular dynamics simulation for all, Neuron 99 (6) (2018) 1129–1143.

[4]

O. Engkvist, P.-O. Åstrand, G. Karlström, Accurate intermolecular potentials obtained from molecular wave functions: Bridging the gap between quantum chemistry and molecular simulations, Chemical Reviews 100 (11) (2000) 4087–4108.

[5]

V. L. Deringer, M. A. Caro, G. Csányi, Machine learning interatomic potentials as emerging tools for materials science, Advanced Materials 31 (46) (2019) 1902765.

[6]

G. S. Markomanolis, A. Alpay, J. Young, M. Klemm, N. Malaya, A. Esposito, J. Heikonen, S. Bastrakov, A. Debus, T. Kluge, et al., Evaluating gpu programming models for the lumi supercomputer, in: Asian Conference on Supercomputing Frontiers, Springer International Publishing Cham, 2022, pp. 79–101.

[7]

S. Plimpton, Fast parallel algorithms for short-range molecular dynamics, Journal of computational physics 117 (1) (1995) 1–19.

[8]

A. P. Thompson, H. M. Aktulga, R. Berger, D. S. Bolintineanu, W. M. Brown, P. S. Crozier, P. J. In’t Veld, A. Kohlmeyer, S. G. Moore, T. D. Nguyen, et al., Lammps-a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales, Computer physics communications 271 (2022) 108171.

[9]

C. R. Trott, D. Lebrun-Grandié, D. Arndt, J. Ciesko, V. Dang, N. Ellingwood, R. Gayatri, E. Harvey, D. S. Hollman, D. Ibanez, N. Liber, J. Madsen, J. Miles, D. Poliakoff, A. Powell, S. Rajamanickam, M. Simberg, D. Sunderland, B. Turcksin, J. Wilke, Kokkos 3: Programming model extensions for the exascale era, IEEE Transactions on Parallel and Distributed Systems 33 (4) (2022) 805–817. https://doi.org/10.1109/TPDS.2021.3097283.

[10]

H. C. Edwards, C. R. Trott, D. Sunderland, http://www.sciencedirect.com/science/article/pii/S0743731514001257, Journal of Parallel and Distributed Computing 74 (12) (2014) 3202 – 3216, domain-Specific Languages and High-Level Frameworks for High-Performance Computing. https://doi.org/https://doi.org/10.1016/j.jpdc.2014.07.003. ://www.sciencedirect.com/science/article/pii/S0743731514001257.

[11]

J. Byggmästar, K. Nordlund, F. Djurabekova, Simple machine-learned interatomic potentials for complex alloys, Physical Review Materials 6 (8) (2022) 083801.

[12]

Y. Luo, J. Byggmästar, M. R. Daymond, L. K. Béland, Interatomic force fields for zirconium based on the embedded atom method and the tabulated gaussian approximation potential, Computational Materials Science 233 (2024) 112730.

[13]

A. P. Thompson, L. P. Swiler, C. R. Trott, S. M. Foiles, G. J. Tucker, Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials, Journal of Computational Physics 285 (2015) 316–330.

[14]

R. Drautz, Atomic cluster expansion for accurate and transferable interatomic potentials, Physical Review B 99 (1) (2019) 014104.

[15]

Y. Lysogorskiy, C. v. d. Oord, A. Bochkarev, S. Menon, M. Rinaldi, T. Hammerschmidt, M. Mrovec, A. Thompson, G. Csányi, C. Ortner, et al., Performant implementation of the atomic cluster expansion (pace) and application to copper and silicon, npj computational materials 7 (1) (2021) 97.

[16]

K. Nguyen-Cong, J. T. Willman, S. G. Moore, A. B. Belonoshko, R. Gayatri, E. Weinberg, M. A. Wood, A. P. Thompson, I. I. Oleynik, Billion atom molecular dynamics simulations of carbon at extreme conditions and experimental time and length scales, in: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, 2021, pp. 1–12.

[17]

A. V. Shapeev, Moment tensor potentials: A class of systematically improvable interatomic potentials, Multiscale Modeling & Simulation 14 (3) (2016) 1153–1173.

[18]

E. V. Podryabinkin, A. V. Shapeev, Active learning of linearly parametrized interatomic potentials, Computational Materials Science 140 (2017) 171–180.

[19]

I. S. Novikov, K. Gubaev, E. V. Podryabinkin, A. V. Shapeev, The mlip package: moment tensor potentials with mpi and active learning, Machine Learning: Science and Technology 2 (2) (2020) 025002.

[20]

K. Zongo, H. Sun, C. Ouellet-Plamondon, L. K. Béland, A unified moment tensor potential for silicon, oxygen, and silica, npj Computational Materials 10 (1) (2024) 218.

[21]

Y. Luo, J. A. Meziere, G. D. Samolyuk, G. L. Hart, M. R. Daymond, L. K. Béland, A set of moment tensor potentials for zirconium with increasing complexity, Journal of Chemical Theory and Computation 19 (19) (2023) 6848–6856.

[22]

H. Sun, C. Maxwell, E. Torres, L. K. Béland, Interatomic potential for sodium and chlorine in both neutral and ionic states, Physical Review B 109 (17) (2024) 174113.

[23]

J. A. Meziere, Y. Luo, Y. Xia, L. K. Béland, M. R. Daymond, G. L. Hart, Accelerating training of mlips through small-cell training, Journal of Materials Research 38 (24) (2023) 5095–5105.

[24]

H. Kwon, M. Shiga, H. Kimizuka, T. Oda, Accurate description of hydrogen diffusivity in bcc metals using machine-learning moment tensor potentials and path-integral methods, Acta Materialia 247 (2023) 118739.

[25]

J. Wang, P. Liu, H. Zhu, M. Liu, H. Ma, Y. Chen, Y. Sun, X.-Q. Chen, Efficient moment tensor machine-learning interatomic potential for accurate description of defects in ni-al alloys, Physical Review Materials 9 (5) (2025) 053805.

[26]

I. Novikov, B. Grabowski, F. Körmann, A. Shapeev, Magnetic moment tensor potentials for collinear spin-polarized materials reproduce different magnetic states of bcc fe, npj Computational Materials 8 (1) (2022) 13.

[27]

S. Attarian, D. Morgan, I. Szlufarska, Thermophysical properties of flibe using moment tensor potentials, Journal of Molecular Liquids 368 (2022) 120803.

[28]

I. Novoselov, A. Yanilkin, A. Shapeev, E. Podryabinkin, Moment tensor potentials as a promising tool to study diffusion processes, Computational Materials Science 164 (2019) 46–56.

[29]

J. Qi, Z. Aitken, Q. Pei, A. M. Z. Tan, Y. Zuo, M. Jhon, S. Quek, T. Wen, Z. Wu, S. P. Ong, Machine learning moment tensor potential for modeling dislocation and fracture in l1 0-tial and d0 19-ti 3 al alloys, Physical Review Materials 7 (10) (2023) 103602.

[30]

H. Chen, D. Yuan, H. Geng, W. Hu, B. Huang, Development of a machine-learning interatomic potential for uranium under the moment tensor potential framework, Computational Materials Science 229 (2023) 112376.

[31]

M. Poul, L. Huber, E. Bitzek, J. Neugebauer, Systematic atomic structure datasets for machine learning potentials: Application to defects in magnesium, Physical Review B 107 (10) (2023) 104103.

[32]

J. Kiefer, Optimum experimental designs, Journal of the Royal Statistical Society: Series B (Methodological) 21 (2) (1959) 272–304.

[33]

S. A. Goreinov, I. V. Oseledets, D. V. Savostyanov, E. E. Tyrtyshnikov, N. L. Zamarashkin, How to find a good submatrix, in: Matrix Methods: Theory, Algorithms And Applications: Dedicated to the Memory of Gene Golub, World Scientific, 2010, pp. 247–256.

[34]

N. Leimeroth, L. C. Erhard, K. Albe, J. Rohrer, Machine-learning interatomic potentials from a users perspective: A comparison of accuracy, speed and data efficiency, arXiv preprint arXiv:2505.02503 (2025).

[35]

Z. Meng, https://doi.org/10.5281/zenodo.17236615(2025). https://doi.org/10.5281/zenodo.17236615. ://doi.org/10.5281/zenodo.17236615.

[36]

E. Podryabinkin, K. Garifullin, A. Shapeev, I. Novikov, Mlip-3: Active learning on atomic environments with moment tensor potentials, The Journal of Chemical Physics 159 (8) (2023).

[37]

Z. Meng, https://doi.org/10.5281/zenodo.17201598(2025). https://doi.org/10.5281/zenodo.17201598. ://doi.org/10.5281/zenodo.17201598.

[38]

A. Stukowski, Visualization and analysis of atomistic simulation data with ovito–the open visualization tool, Modelling and simulation in materials science and engineering 18 (1) (2009) 015012.

[39]

P. Hirel, Atomsk: A tool for manipulating and converting atomic data files, Computer Physics Communications 197 (2015) 212–219.

[40]

A. Madanchi, E. Azek, K. Zongo, L. K. Béland, N. Mousseau, L. Simine, Is the future of materials amorphous? challenges and opportunities in simulations of amorphous materials, ACS Physical Chemistry Au 5 (1) (2024) 3–16.

[41]

C. L. Rountree, R. K. Kalia, E. Lidorikis, A. Nakano, L. Van Brutzel, P. Vashishta, Atomistic aspects of crack propagation in brittle materials: Multimillion atom molecular dynamics simulations, Annual Review of Materials Research 32 (1) (2002) 377–400.

[42]

K. Zhou, B. Liu, Y. Yao, K. Zhong, Effects of grain size and shape on mechanical properties of nanocrystalline copper investigated by molecular dynamics, Materials Science and Engineering: A 615 (2014) 92–97.

[43]

K. Nordlund, J. Peltola, J. Nord, J. Keinonen, R. Averback, Defect clustering during ion irradiation of gaas: Insight from molecular dynamics simulations, Journal of Applied Physics 90 (4) (2001) 1710–1717.

[44]

S. I. Rao, C. Woodward, T. A. Parthasarathy, O. Senkov, Atomistic simulations of dislocation behavior in a model fcc multicomponent concentrated solid solution alloy, Acta Materialia 134 (2017) 188–194.

https://github.com/RichardZJM/lammps-mtp-kokkos ↩︎

A Kokkos-Accelerated Moment Tensor Potential Implementation for LAMMPS