Skip to main content

A benchmark for machine learning energy models on inorganic crystal stability prediction from unrelaxed structures

Project description

Logo
Matbench Discovery

arXiv Tests GitHub Pages Requires Python 3.9+ PyPI

TL;DR: We benchmark ML models on crystal stability prediction from unrelaxed structures finding universal interatomic potentials (UIP) like CHGNet, M3GNet and MACE to be highly accurate, robust across chemistries and ready for production use in high-throughput discovery pipelines.

Matbench Discovery is an interactive leaderboard and associated PyPI package which together make it easy to rank ML energy models on a task designed to closely simulate a high-throughput discovery campaign for new stable inorganic crystals.

So far, we've tested 8 models covering multiple methodologies ranging from random forests with structure fingerprints to graph neural networks, from one-shot predictors to iterative Bayesian optimizers and interatomic potential relaxers. We find CHGNet (paper) to achieve the highest F1 score of 0.59, $R^2$ of 0.61 and a discovery acceleration factor (DAF) of 3.06 (meaning a 3x higher rate of stable structures compared to dummy selection in our already enriched search space). We believe our results show that ML models have become robust enough to deploy them as triaging steps to more effectively allocate compute in high-throughput DFT relaxations. This work provides valuable insights for anyone looking to build large-scale materials databases.

We welcome contributions that add new models to the leaderboard through GitHub PRs. See the contributing guide for details.

If you're interested in joining this work, feel free to open a GitHub discussion or send an email.

For detailed results and analysis, check out the preprint.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

matbench-discovery-1.0.0.tar.gz (40.3 kB view details)

Uploaded Source

Built Distribution

matbench_discovery-1.0.0-py2.py3-none-any.whl (32.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file matbench-discovery-1.0.0.tar.gz.

File metadata

  • Download URL: matbench-discovery-1.0.0.tar.gz
  • Upload date:
  • Size: 40.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for matbench-discovery-1.0.0.tar.gz
Algorithm Hash digest
SHA256 7c1b303af471788458a7ea3918d8951450d30b1f638a79852c04151c62f48b94
MD5 11cc5c96d3e5553e8fd995dd083e1f71
BLAKE2b-256 494764a4a5a40afa21d0eb0b285555b62c67cf28f45073f1780a91b700c80ea4

See more details on using hashes here.

File details

Details for the file matbench_discovery-1.0.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for matbench_discovery-1.0.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 29ad3f7841a2322998d3d7db060b8a5c460109865b6b25e41ffe6e2051f8fdda
MD5 986a23eb1d8bf7c24008ff5181c37166
BLAKE2b-256 f10bedd2476dd3ad592d3caceb6976f5aae13156af13aeb46b233ba2428ea8fd

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page