Skip to main content

A benchmark for machine learning energy models on inorganic crystal stability prediction from unrelaxed structures

Project description

Logo Matbench Discovery

Tests GitHub Pages pre-commit.ci status Requires Python 3.9+ PyPI

TL;DR: We benchmark ML models on crystal stability prediction from unrelaxed structures finding interatomic potentials in particular to be a valuable addition to high-throughput discovery pipelines.

Matbench Discovery is an interactive leaderboard and associated PyPI package which together make it easy to benchmark ML energy models on a task designed to closely simulate a high-throughput discovery campaign for new stable inorganic crystals.

In version 1 of this benchmark, we explore 8 models covering multiple methodologies ranging from random forests to graph neural networks, from one-shot predictors to iterative Bayesian optimizers and interatomic potential-based relaxers. We find M3GNet (paper) to achieve the highest F1 score of 0.58 and $R^2$ of 0.59 while MEGNet (paper) wins on discovery acceleration factor (DAF) with 2.94. See the full results in our interactive dashboard which provides valuable insights for maintainers of large-scale materials databases. We show these models have become powerful enough to warrant deploying them as triaging steps to more effectively allocate compute in high-throughput DFT relaxations.

We welcome contributions that add new models to the leaderboard through GitHub PRs. See the usage and contributing guide for details.

For a version 2 release of this benchmark, we plan to merge the current training and test sets into the new training set and acquire a much larger test set compared to the v1 test set of 257k structures.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

matbench-discovery-0.1.1.tar.gz (34.0 kB view details)

Uploaded Source

File details

Details for the file matbench-discovery-0.1.1.tar.gz.

File metadata

  • Download URL: matbench-discovery-0.1.1.tar.gz
  • Upload date:
  • Size: 34.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.0

File hashes

Hashes for matbench-discovery-0.1.1.tar.gz
Algorithm Hash digest
SHA256 4de3cc0e234a028fcf14a47225064e1596649ca27ba6ab2928ba74ee2c0f4513
MD5 00ef2fed56684d6313c1e434f125edef
BLAKE2b-256 cdacc4937a8ad9cc4b479bf02e3bc409a7da5f1e74a57880b941bf67699daf91

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page