Add ByteFF-GNN and ByteFF-Pol v2; keep legacy v1 at trained_models/, valid_data/
#3
by xux1ngyuan - opened
- ByteFF-GNN/example_data/hessian_example.csv +0 -0
- ByteFF-GNN/example_data/hessian_example.h5 +3 -0
- ByteFF-GNN/example_data/torsion_example.csv +0 -0
- ByteFF-GNN/example_data/torsion_example.h5 +3 -0
- ByteFF-Pol/trained_models_v2/fftrainer_config_in_use.yaml +67 -0
- ByteFF-Pol/trained_models_v2/optimal.pt +3 -0
- README.md +29 -12
- valid_data/dataset_config.yaml +2 -3
- valid_data/processed_data_shard0.pkl +2 -2
ByteFF-GNN/example_data/hessian_example.csv
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
ByteFF-GNN/example_data/hessian_example.h5
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2181c999d064e856f342e69af9a8263158e5b8fcda42793817d43f90bd2d7575
|
| 3 |
+
size 48510784
|
ByteFF-GNN/example_data/torsion_example.csv
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
ByteFF-GNN/example_data/torsion_example.h5
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:265907c1aca2f4e67f2777ccc99268a6d4ea9fe11513fa4c6a4de5a2879c082e
|
| 3 |
+
size 66131920
|
ByteFF-Pol/trained_models_v2/fftrainer_config_in_use.yaml
ADDED
|
@@ -0,0 +1,67 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
model:
|
| 2 |
+
ff_block:
|
| 3 |
+
- act: gelu
|
| 4 |
+
out_mlp_dims:
|
| 5 |
+
- 256
|
| 6 |
+
- 4
|
| 7 |
+
post_mlp_dims:
|
| 8 |
+
- 256
|
| 9 |
+
- 256
|
| 10 |
+
- 1
|
| 11 |
+
pre_mlp_dims:
|
| 12 |
+
- 256
|
| 13 |
+
- 256
|
| 14 |
+
- 3
|
| 15 |
+
tanh_output: 20.0
|
| 16 |
+
type: MMBondedConj
|
| 17 |
+
- act: gelu
|
| 18 |
+
out_mlp_dims:
|
| 19 |
+
- 256
|
| 20 |
+
- 3
|
| 21 |
+
pre_mlp_dims:
|
| 22 |
+
- 256
|
| 23 |
+
- 256
|
| 24 |
+
- 3
|
| 25 |
+
type: ChargeVolume
|
| 26 |
+
- act: gelu
|
| 27 |
+
charge14: 0.5
|
| 28 |
+
disp_damping_factor: 120.0
|
| 29 |
+
fix_li_alpha: 1.0e-06
|
| 30 |
+
ind14: 0.5
|
| 31 |
+
li_damp_clip: 0.2
|
| 32 |
+
out_mlp_dims:
|
| 33 |
+
- 256
|
| 34 |
+
- 3
|
| 35 |
+
pol_damping_factor: 0.39
|
| 36 |
+
pre_mlp_dims:
|
| 37 |
+
- 256
|
| 38 |
+
- 256
|
| 39 |
+
- 3
|
| 40 |
+
s12: 1.5
|
| 41 |
+
type: Exp6Pol
|
| 42 |
+
vdw14: 0.5
|
| 43 |
+
graph_block:
|
| 44 |
+
feature_layer:
|
| 45 |
+
act: gelu
|
| 46 |
+
atom_embedding_dim: 32
|
| 47 |
+
edge_mlp_dims:
|
| 48 |
+
- 64
|
| 49 |
+
- 256
|
| 50 |
+
- 2
|
| 51 |
+
node_mlp_dims:
|
| 52 |
+
- 64
|
| 53 |
+
- 256
|
| 54 |
+
- 2
|
| 55 |
+
gnn_layer:
|
| 56 |
+
act: gelu
|
| 57 |
+
at_channels: 16
|
| 58 |
+
ffn_dims:
|
| 59 |
+
- 256
|
| 60 |
+
- 2
|
| 61 |
+
gnn_dims:
|
| 62 |
+
- 256
|
| 63 |
+
- 256
|
| 64 |
+
- 4
|
| 65 |
+
gnn_type: EGT
|
| 66 |
+
heads: 16
|
| 67 |
+
jk: cat
|
ByteFF-Pol/trained_models_v2/optimal.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:04d6d1472e75d2a514d29fb5b2ee4d2a96a259c5154fdfb30a1b13bd70968296
|
| 3 |
+
size 82200474
|
README.md
CHANGED
|
@@ -6,28 +6,45 @@ tags:
|
|
| 6 |
---
|
| 7 |
# ByteFF2
|
| 8 |
|
| 9 |
-
This repository contains the model
|
| 10 |
|
| 11 |
-
[ByteFF-
|
|
|
|
|
|
|
| 12 |
|
| 13 |
# Trained Models
|
| 14 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
|
| 16 |
# How to use
|
| 17 |
Code and examples are available in the [byteff2](https://github.com/ByteDance-Seed/byteff2) repository.
|
| 18 |
|
| 19 |
## Citation
|
| 20 |
-
If you find ByteFF-Pol is useful for your research and applications, feel free to give us a star ⭐ or cite us using:
|
| 21 |
|
| 22 |
```bibtex
|
| 23 |
|
| 24 |
-
@
|
| 25 |
-
title
|
| 26 |
-
author
|
| 27 |
-
year
|
| 28 |
-
|
| 29 |
-
|
| 30 |
-
|
| 31 |
-
url = {https://arxiv.org/abs/2508.08575}
|
| 32 |
}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
```
|
|
|
|
| 6 |
---
|
| 7 |
# ByteFF2
|
| 8 |
|
| 9 |
+
This repository contains the model and example data used for ByteFF-GNN and ByteFF-Pol.
|
| 10 |
|
| 11 |
+
* [ByteFF-GNN](https://pubs.rsc.org/en/content/articlehtml/2025/sc/d4sc06640e) is a molecular mechanics force field parameterized by an edge-augmented, symmetry-preserving graph neural network (GNN), trained on large-scale high-level quantum mechanics (QM) data. ByteFF-GNN enables fast, one-pass prediction of bonded and non-bonded parameters for drug-like molecules, achieving SOTA accuracy across torsional energy profiles, relaxed geometries, conformational energies, and off-equilibrium energies and forces.
|
| 12 |
+
|
| 13 |
+
* [ByteFF-Pol](https://www.nature.com/articles/s41467-026-73566-3) is a polarizable force field parameterized by a graph neural network (GNN), trained on high-level quantum mechanics (QM) data, thus eliminating the need for experimental calibration. ByteFF-Pol achieves exceptional accuracy in predicting the thermodynamic and transport properties of small-molecule liquids and electrolytes, outperforming SOTA traditional and ML force fields.
|
| 14 |
|
| 15 |
# Trained Models
|
| 16 |
+
* **ByteFF-Pol**: Two model versions are provided:
|
| 17 |
+
- `ByteFF-Pol/trained_models_v2/` — latest model (**recommended**).
|
| 18 |
+
- `trained_models/` — legacy v1 model, kept at the original path for backward compatibility (paired with `valid_data/`).
|
| 19 |
+
* **ByteFF-GNN**: Example training data in `ByteFF-GNN/example_data/`.
|
| 20 |
+
|
| 21 |
|
| 22 |
# How to use
|
| 23 |
Code and examples are available in the [byteff2](https://github.com/ByteDance-Seed/byteff2) repository.
|
| 24 |
|
| 25 |
## Citation
|
| 26 |
+
If you find ByteFF-Pol or ByteFF-GNN is useful for your research and applications, feel free to give us a star ⭐ or cite us using:
|
| 27 |
|
| 28 |
```bibtex
|
| 29 |
|
| 30 |
+
@article{zheng2026bridging,
|
| 31 |
+
title = {Bridging quantum mechanics to liquid properties via a universal organic force field},
|
| 32 |
+
author = {Tianze Zheng and Xingyuan Xu and Zhi Wang and Zhenze Yang and Yuanheng Wang and Xu Han and Lei Chen and Zhenliang Mu and Ziqing Zhang and Siyuan Liu and Sheng Gong and Kuang Yu and Wen Yan},
|
| 33 |
+
year = {2026},
|
| 34 |
+
journal = {Nature Communications},
|
| 35 |
+
doi = {10.1038/s41467-026-73566-3},
|
| 36 |
+
url = {https://www.nature.com/articles/s41467-026-73566-3}
|
|
|
|
| 37 |
}
|
| 38 |
+
|
| 39 |
+
@Article{D4SC06640E,
|
| 40 |
+
author = {Tianze Zheng and Ailun Wang and Xu Han and Yu Xia and Xingyuan Xu and Jiawei Zhan and Yu Liu and Yang Chen and Zhi Wang and Xiaojie Wu and Sheng Gong and Wen Yan},
|
| 41 |
+
title = {Data-driven parametrization of molecular mechanics force fields for expansive chemical space coverage},
|
| 42 |
+
journal = {Chem. Sci.},
|
| 43 |
+
year = {2025},
|
| 44 |
+
pages = {-},
|
| 45 |
+
publisher = {The Royal Society of Chemistry},
|
| 46 |
+
doi = {10.1039/D4SC06640E},
|
| 47 |
+
url = {http://dx.doi.org/10.1039/D4SC06640E}
|
| 48 |
+
}
|
| 49 |
+
|
| 50 |
```
|
valid_data/dataset_config.yaml
CHANGED
|
@@ -4,8 +4,7 @@ confdata:
|
|
| 4 |
data_cls: ClusterData
|
| 5 |
kwargs:
|
| 6 |
max_n_confs: 20
|
| 7 |
-
meta_fp: /mnt/bn/byteff-dev-large/users/zhengtianze/byteff_v2/meta.txt
|
| 8 |
moldata: {}
|
| 9 |
-
save_dir: /
|
| 10 |
shards: 1
|
| 11 |
-
timestamp: '
|
|
|
|
| 4 |
data_cls: ClusterData
|
| 5 |
kwargs:
|
| 6 |
max_n_confs: 20
|
|
|
|
| 7 |
moldata: {}
|
| 8 |
+
save_dir: ../../../byteff2/valid_data
|
| 9 |
shards: 1
|
| 10 |
+
timestamp: '26_06_16_13_23_12'
|
valid_data/processed_data_shard0.pkl
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c316628a2185cd029542cebd40dcb5754ab0bb2bd841060a7b7c4140a0bd5e7f
|
| 3 |
+
size 111042000
|