v3.0.0a0 #3401

njzjz · 2024-03-03T09:22:27Z

njzjz
Mar 3, 2024
Maintainer

DeePMD-kit v3: A multiple-backend framework for deep potentials

We are excited to announce the first alpha version of DeePMD-kit v3. DeePMD-kit v3 allows you to train and run deep potential models on top of TensorFlow or PyTorch. DeePMD-kit v3 also supports the DPA-2 model, a novel architecture for large atomic models.

Highlights

Multiple-backend framework

DeePMD-kit v3 adds a pluggable multiple-backend framework to provide consistent training and inference experiences between different backends. You can:

Use the same training data and the input script to train a deep potential model with different backends. Switch backends based on efficiency, functionality, or convenience:

# Training a model using the TensorFlow backend
dp --tf train input.json
dp --tf freeze

# Training a mode using the PyTorch backend
dp --pt train input.json
dp --pt freeze

Use any model to perform inference via any existing interfaces, including dp test, Python/C++/C interface, and third-party packages (dpdata, ASE, LAMMPS, AMBER, Gromacs, i-PI, CP2K, OpenMM, ABACUS, etc). Take an example on LAMMPS:

# run LAMMPS with a TensorFlow backend model
pair_style deepmd frozen_model.pb
# run LAMMPS with a PyTorch backend model
pair_style deepmd frozen_model.pth
# Calculate model deviation using both models
pair_style deepmd frozen_model.pb frozen_model.pth out_file md.out out_freq 100

Convert models between backends, using dp convert-backend, if both backends support a model:

dp convert-backend frozen_model.pb frozen_model.pth
dp convert-backend frozen_model.pth frozen_model.pb

Add a new backend to DeePMD-kit much more quickly if you want to contribute to DeePMD-kit.

PyTorch backend: a backend designed for large atomic models and new research

We added the PyTorch backend in DeePMD-kit v3 to support the development of new models, especially for large atomic models.

DPA-2 model: Towards a universal large atomic model for molecular and material simulation

DPA-2 model is a novel architecture for Large Atomic Model (LAM) and can accurately represent a diverse range of chemical systems and materials, enabling high-quality simulations and predictions with significantly reduced efforts compared to traditional methods. The DPA-2 model is only implemented in the PyTorch backend. An example configuration is in the examples/water/dpa2 directory.

The DPA-2 descriptor includes two primary components: repinit and repformer. The detailed architecture is shown in the following figure.

Training strategies for large atomic models

The PyTorch backend has supported multiple training strategies to develop large atomic models.

Parallel training: Large atomic models have a number of hyper-parameters and complex architecture, so training a model on multiple GPUs is necessary. Benefiting from the PyTorch community ecosystem, the parallel training for the PyTorch backend can be driven by torchrun, a launcher for distributed data parallel.

torchrun --nproc_per_node=4 --no-python dp --pt train input.json

Multi-task training: Large atomic models are trained against data in a wide scope and at different DFT levels, which requires multi-task training. The PyTorch backend supports multi-task training, sharing the descriptor between different An example is given in examples/water_multi_task/pytorch_example/input_torch.json.

Finetune: Fine-tune is useful to train a pre-train large model on a smaller, task-specific dataset. The PyTorch backend has supported --finetune argument in the dp --pt train command line.

Developing new models using Python and dynamic graphs

Researchers may feel pain about the static graph and the custom C++ OPs from the TensorFlow backend, which sacrifices research convenience for computational performance. The PyTorch backend has a well-designed code structure written using the dynamic graph, which is currently 100% written with the Python language, making extending and debugging new deep potential models easier than the static graph.

Supporting traditional deep potential models

People may still want to use the traditional models already supported by the TensorFlow backend in the PyTorch backend and compare the same model among different backends. We almost rewrote all of the traditional models in the PyTorch backend, which are listed below:

Features supported:
- Descriptor: se_e2_a, se_e2_r, se_atten, hybrid;
- Fitting: energy, dipole, polar, fparam/apram support
- Model: standard, DPRc
- Python inference interface
- C++ inference interface for energy only
- TensorBoard
Features not supported yet:
- Descriptor: se_e3, se_atten_v2, se_e2_a_mask
- Fitting: dos
- Model: linear_ener, DPLR, pairtab, linear_ener, frozen, pairwise_dprc, ZBL, Spin
- Model compression
- Python inference interface for DPLR
- C++ inference interface for tensors and DPLR
- Paralleling training using Horovod
Features not planned:
- Descriptor: loc_frame, se_e2_a + type embedding, se_a_ebd_v2
- NVNMD

Warning

As part of an alpha release, the PyTorch backend's API or user input arguments may change before the first stable version.

DP backend and format: reference backend for other backends

DP is a reference backend for development that uses pure NumPy to implement models without using any heavy deep-learning frameworks. It cannot be used for training but only for Python inference. As a reference backend, it is not aimed at the best performance but only the correct results. The DP backend uses HDF5 to store model serialization data, which is backend-independent.
The DP backend and the serialization data are used in the unit test to ensure different backends have consistent results and can be converted between each other.
In the current version, the DP backend has a similar supporting status to the PyTorch backend, while DPA-1 and DPA-2 are not supported yet.

Authors

The above highlights were mainly contributed by

Breaking changes

Python 3.7 support is dropped. by @njzjz in breaking: drop Python 3.7 support #3185
We require all model files to have the correct filename extension for all interfaces so a corresponding backend can load them. TensorFlow model files must end with .pb extension.
Python class DeepTensor (including DeepDiople and DeepPolar) now returns atomic tensor in the dimension of natoms instead of nsel_atoms. by @njzjz in breaking: change DeepTensor output dim from nsel_atoms to natoms #3390
For developers: the Python module structure is fully refactored. The old deepmd module was moved to deepmd.tf without other API changes, and deepmd_utils was moved to deepmd without other API changes. by @njzjz in breaking: move deepmd to deepmd.tf #3177, breaking: move deepmd_utils to deepmd #3178

Other changes

Enhancement

Neighbor stat for the TensorFlow backend is 80x accelerated. by @njzjz in tf: refactor neighbor stat #3275
i-PI: remove normalize_coord by @njzjz in ipi: remove normalize_coord #3257
LAMMPS: fix_dplr.cpp delete redundant setup and set atom->image when pre_force by @shiruosong in fix_dplr.cpp delete redundant setup #3344, fix_dplr.cpp set atom->image when pre_force #3345
Bump scikit-build-core to 0.8 by @njzjz in bump scikit-build-core to 0.8 #3369
Bump LAMMPS to stable_2Aug2023_update3 by @njzjz in bump LAMMPS to stable_2Aug2023_update3 #3399
Add fparam/aparam support for fine-tune by @njzjz in tf: add fparam/aparam support for finetune #3313
TF: remove freeze warning for optional nodes by @njzjz in tf: remove freeze warning for optional nodes #3381

CI/CD

Build macos-arm64 wheel on M1 runners by @njzjz in build macos-arm64 wheel on M1 runners #3206
Other improvements and fixes to GitHub Actions by @njzjz in pin docker actions to major versions #3238, improve gh actions #3283, speed up cuda test #3284, fix gh actions issues #3288, gh actions: fix branches ignore pattern & fix activity types #3290, fix AlmaLinux GPG key error #3326
Enable docstring code format by @njzjz in enable docstring code format #3267

Bugfix

Fix TF 2.16 compatibility by @njzjz in bump python to 3.12 in the test environment #3343
Detect version in advance before building deepmd-kit-cu11 by @njzjz in detect version in advance before building deepmd-kit-cu11 #3172
C API: change the required shape of electric field to nloc * 3 by @njzjz in c: change the required shape of electric field to nloc * 3 #3237

New Contributors

@anyangml made their first contribution in Feat: add pair table model to pytorch #3192
@shiruosong made their first contribution in fix_dplr.cpp delete redundant setup #3344

Full Changelog: v2.2.8...v3.0.0a0

This discussion was created from the release v3.0.0a0.

ShangChien · 2024-03-04T09:15:30Z

ShangChien
Mar 4, 2024

Great works 🎉
Question about section of DP backend: How to use pure Numpy for Python inference? Any example?

4 replies

njzjz Mar 4, 2024
Maintainer Author

First you need to convert an existing model into the .dp format:

dp convert-backend aaa.pb bbb.dp
# or
dp convert-backend aaa.pth bbb.dp

Then bbb.dp can be used in dp test, dp model-devi, or any Python interface (deepmd.infer.DeepPot, dpdata.System.predict, etc).

ShangChien Mar 4, 2024

To make a long story short, does this mean that we can get the energy and forces of the structure in the browser by pyodide-related projects 👀 ?
Previous related discussion #3100

njzjz Mar 4, 2024
Maintainer Author

It cannot predict forces as numpy doesn't support automatic gradients, unless someone would like to write the backward functions manually.

ShangChien Mar 5, 2024

Awesome DP!

Epikurz · 2024-03-06T07:39:44Z

Epikurz
Mar 6, 2024

Is there any examples for finetuning on DPA-2 pretrained model?

3 replies

ShangChien Mar 6, 2024

Several tutorial on Bohrium website, you can search with DPA tag to find them.
for example: Notebook for DPA-2
hope this helps 😸

Epikurz Mar 6, 2024

Does It mean that all the functions of dp-pytorch version have been inplemented in deepmd-kit 3?

njzjz Mar 6, 2024
Maintainer Author

This repo was entirely imported in #3180.

omidshy · 2024-03-06T20:09:53Z

omidshy
Mar 6, 2024

Nice work!

In the DPA-2 descriptor, the outputs of the env block are the rotational invariant gij and rotational equivariant hij representations. Does this make the DPA-2 an equivariant model?

0 replies

wanghan-iapcm · 2024-03-11T01:17:30Z

wanghan-iapcm
Mar 11, 2024
Maintainer

Nice work!

In the DPA-2 descriptor, the outputs of the env block are the rotational invariant gij and rotational equivariant hij representations. Does this make the DPA-2 an equivariant model?

When discussing the representation of a rotation group, specifically SO(3), it is necessary to introduce the concept of representation space. In the majority of "equivariant" models, this space is typically represented by spherical harmonics. To preserve as much information about the atomic environment as possible, one must increase the completeness (denoted as 'l' in the context of spherical harmonics) of the basis. However, this comes at the cost of increased computational expense.

In the development of DP models, the approach does not involve spanning the atomic environment within a representation space. Instead, it focuses on utilizing the original (equivariant) information without any approximation. This may be the reason why DPA-2 exhibits higher generalizability in our benchmarks, as evidenced by supplementary S2 of https://arxiv.org/abs/2312.15492.

0 replies

poyaii · 2024-06-08T19:06:50Z

poyaii
Jun 8, 2024

Hello. Thank you for developing the new version of DeepMd. When will the stable version of this release be available? Is it possible to install it via Conda? Thanks a lot.

1 reply

njzjz Jun 8, 2024
Maintainer Author

Conda packages are available at the conda-forge/label/deepmd-kit_dev channel: https://anaconda.org/conda-forge/deepmd-kit/files?version=3.0.0a0

There is no ETA for the stable version, but at least we need to resolve all issues in https://github.com/orgs/deepmodeling/projects/19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v3.0.0a0 #3401

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 5 comments 8 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

v3.0.0a0 #3401

njzjz Mar 3, 2024 Maintainer

DeePMD-kit v3: A multiple-backend framework for deep potentials

Highlights

Multiple-backend framework

PyTorch backend: a backend designed for large atomic models and new research

DPA-2 model: Towards a universal large atomic model for molecular and material simulation

Training strategies for large atomic models

Developing new models using Python and dynamic graphs

Supporting traditional deep potential models

DP backend and format: reference backend for other backends

Authors

Breaking changes

Other changes

Enhancement

CI/CD

Bugfix

New Contributors

Replies: 5 comments · 8 replies

njzjz Mar 4, 2024 Maintainer Author

njzjz Mar 4, 2024 Maintainer Author

njzjz Mar 6, 2024 Maintainer Author

wanghan-iapcm Mar 11, 2024 Maintainer

njzjz Jun 8, 2024 Maintainer Author

njzjz
Mar 3, 2024
Maintainer

Replies: 5 comments 8 replies

njzjz Mar 4, 2024
Maintainer Author

njzjz Mar 4, 2024
Maintainer Author

njzjz Mar 6, 2024
Maintainer Author

wanghan-iapcm
Mar 11, 2024
Maintainer

njzjz Jun 8, 2024
Maintainer Author