Skip to main content

MLgym, a python framework for distributed machine learning model training in research.

Project description

mlGym_logo

MLgym

a python framework for distributed machine learning model training in research.

CircleCI

At its core, MLgym offers functionality to run gridsearches of Pytorch models at scale split over multiple GPUs and centrally store the results using DashifyML.

Futhermore, MLgym provides the following key features:

  • Reproducibility of results due to full experiment specification including dataset, preprocessing routines, model architecture, loss function, metrics and more within a single YAML config.

  • Component registry to register custom components with dependencies. For instance one can define a new preprocessing routine component. This component may depend on an iterator component, as specified in the experiment config. During runtime these components are instantiated on the fly.

  • Resume training after crash

  • Custom training routines, e.g., training with partially frozen network weights

  • Large scale, multi GPU training supporting Grid Search, Nested Cross Validation, Cross Validation

  • Reduced logging to reduce storage footprint of model and optimizer states

Please note, that at the moment this code should be treated as experimental and is not production ready.

Install

there are two options to install MLgym, the easiest way is to install it from the pip repository:

pip install mlgym

For the latest version, one can directly install it from source by cd into the root folder and then running

pip install src/

Usage

NOTE: This framework is still under heavy development and mainly used in research projects. It's most likely not free of bugs and interfaces can still change.

For usage see this example.

Copyright

Copyright (c) 2020 Max Lübbering, Rajkumar Ramamurthy

For license see: https://github.com/le1nux/mlgym/blob/master/LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlgym-0.0.53.tar.gz (43.5 kB view details)

Uploaded Source

Built Distribution

mlgym-0.0.53-py3-none-any.whl (62.9 kB view details)

Uploaded Python 3

File details

Details for the file mlgym-0.0.53.tar.gz.

File metadata

  • Download URL: mlgym-0.0.53.tar.gz
  • Upload date:
  • Size: 43.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for mlgym-0.0.53.tar.gz
Algorithm Hash digest
SHA256 d30fbcbd2f8e833301364a2ca468982ec59693c92d562b523df209230819134d
MD5 ab9ca8d6f623857665ef6a4219c8dd57
BLAKE2b-256 8bdf1e1b3c964faa5bd65508ea40f193cf119a2eaa39e6dfdaa43451fbb19f27

See more details on using hashes here.

File details

Details for the file mlgym-0.0.53-py3-none-any.whl.

File metadata

  • Download URL: mlgym-0.0.53-py3-none-any.whl
  • Upload date:
  • Size: 62.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.63.0 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for mlgym-0.0.53-py3-none-any.whl
Algorithm Hash digest
SHA256 a7c6ea628864e57bfeef3c758b6a41f6160c99d439ef1e26c483e0daf3b6fb9a
MD5 b02ab7c1eb9ce67133bd15f9a221c16f
BLAKE2b-256 3c4dcace5d0523706ca190f4dbd898bbf7738ca83ab3565047b7ec1be8d8aa1d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page