Skip to main content

Azure ML Data Preparation SDK

Reason this release was yanked:

This version added a dependency that does not support Windows Python 3.5 environments.

Project description

Azure Machine Learning Data Prep SDK for Python

The Azure Machine Learning Data Prep SDK is used to load, transform, and write data for machine learning workflows. You can interact with the SDK in any Python environment, including Jupyter Notebooks or your favorite Python IDE.

This python SDK includes the following functionality to help prepare your data for modeling:

  • Intelligent time-saving transformations such as:
  • Automatic file type detection. The SDK can automatically detect any of the supported file types. You don’t need to use special file readers for CSV, text, Excel, etc., or to specify delimiter, header, or encoding parameters.
  • Cross-platform functionality with a single code artifact. The SDK also allows for dataflow objects to be serialized and opened in any Python environment. Write to a single SDK and run it on Windows, macOS, Linux, or Spark in a scale-up or scale-out manner. When running in scale-up, the engine attempts to utilize all hardware threads available, when running scale-out the engine allows the distributed scheduler to optimize execution.
  • Summary statistics can be generated quickly for a dataflow with a single line of code.
  • Scale through streaming. Instead of loading all the data into memory, the SDK engine streams data for better scale and performance on large datasets.

Install the SDK

To install the Azure Machine Learning Data Prep SDK for Python, use the following command:

pip install --upgrade azureml-dataprep

Learn how to use it

Here are some resources to help you learn more about the Azure Machine Learning Data Prep SDK for Python:

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

azureml_dataprep-1.10.0-py3-none-any.whl (28.1 MB view details)

Uploaded Python 3

File details

Details for the file azureml_dataprep-1.10.0-py3-none-any.whl.

File metadata

  • Download URL: azureml_dataprep-1.10.0-py3-none-any.whl
  • Upload date:
  • Size: 28.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.7.6

File hashes

Hashes for azureml_dataprep-1.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 decef3bbf0ab20e4eebcf4d47933045a73bce032b2678d982781396779ff6d60
MD5 71ba494c07cee5fd496f91231c3ab84a
BLAKE2b-256 28affab8259b895501c3397cb27f6be760d36f753db5166f1252ff6b4a9b7ece

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page