A benchmark for machine learning energy models on inorganic crystal stability prediction from unrelaxed structures
Project description
Matbench Discovery
TL;DR: We benchmark ML models on crystal stability prediction from unrelaxed structures finding universal interatomic potentials (UIP) like CHGNet, M3GNet and MACE to be highly accurate, robust across chemistries and ready for production use in high-throughput discovery pipelines.
Matbench Discovery is an interactive leaderboard and associated PyPI package which together make it easy to rank ML energy models on a task designed to closely simulate a high-throughput discovery campaign for new stable inorganic crystals.
So far, we've tested 8 models covering multiple methodologies ranging from random forests with structure fingerprints to graph neural networks, from one-shot predictors to iterative Bayesian optimizers and interatomic potential relaxers. We find CHGNet (paper) to achieve the highest F1 score of 0.59, $R^2$ of 0.61 and a discovery acceleration factor (DAF) of 3.06 (meaning a 3x higher rate of stable structures compared to dummy selection in our already enriched search space). We believe our results show that ML models have become robust enough to deploy them as triaging steps to more effectively allocate compute in high-throughput DFT relaxations. This work provides valuable insights for anyone looking to build large-scale materials databases.
We welcome contributions that add new models to the leaderboard through GitHub PRs. See the contributing guide for details.
If you're interested in joining this work, feel free to open a GitHub discussion or send an email.
For detailed results and analysis, check out the preprint.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file matbench-discovery-1.0.0.tar.gz
.
File metadata
- Download URL: matbench-discovery-1.0.0.tar.gz
- Upload date:
- Size: 40.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7c1b303af471788458a7ea3918d8951450d30b1f638a79852c04151c62f48b94 |
|
MD5 | 11cc5c96d3e5553e8fd995dd083e1f71 |
|
BLAKE2b-256 | 494764a4a5a40afa21d0eb0b285555b62c67cf28f45073f1780a91b700c80ea4 |
File details
Details for the file matbench_discovery-1.0.0-py2.py3-none-any.whl
.
File metadata
- Download URL: matbench_discovery-1.0.0-py2.py3-none-any.whl
- Upload date:
- Size: 32.1 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 29ad3f7841a2322998d3d7db060b8a5c460109865b6b25e41ffe6e2051f8fdda |
|
MD5 | 986a23eb1d8bf7c24008ff5181c37166 |
|
BLAKE2b-256 | f10bedd2476dd3ad592d3caceb6976f5aae13156af13aeb46b233ba2428ea8fd |