Skip to main content

High Level Expressions for Dask

Project description

Dask Expressions

Dask DataFrames with query optimization.

This is a rewrite of Dask DataFrame that includes query optimization and generally improved organization.

More in our blog posts:

Example

import dask_expr as dx

df = dx.datasets.timeseries()
df.head()

df.groupby("name").x.mean().compute()

Query Representation

Dask-expr encodes user code in an expression tree:

>>> df.x.mean().pprint()

Mean:
  Projection: columns='x'
    Timeseries: seed=1896674884

This expression tree will be optimized and modified before execution:

>>> df.x.mean().optimize().pprint()

Div:
  Sum:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
  Count:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884

Stability

This is the default backend for dask.DataFrame since version 2024.3.0.

API Coverage

Dask-Expr covers almost everything of the Dask DataFrame API. The only missing features are:

  • named GroupBy Aggregations

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dask_expr-1.1.2.tar.gz (188.7 kB view details)

Uploaded Source

Built Distribution

dask_expr-1.1.2-py3-none-any.whl (205.5 kB view details)

Uploaded Python 3

File details

Details for the file dask_expr-1.1.2.tar.gz.

File metadata

  • Download URL: dask_expr-1.1.2.tar.gz
  • Upload date:
  • Size: 188.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for dask_expr-1.1.2.tar.gz
Algorithm Hash digest
SHA256 ce2e3803b638cdc67bc75326e1b0d36ea9d231fdddf086e727145a5a2769bed4
MD5 9d193032afebe401166e23e19096dc17
BLAKE2b-256 c2b7da1deafb2ba44c6e66dd45854405400f40d3f692aa4a26429391bbfee978

See more details on using hashes here.

Provenance

File details

Details for the file dask_expr-1.1.2-py3-none-any.whl.

File metadata

  • Download URL: dask_expr-1.1.2-py3-none-any.whl
  • Upload date:
  • Size: 205.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for dask_expr-1.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3be69fb2d449b5edf4404e953b7f6e688426872c6eb10f239539ead716a06f7a
MD5 2cf52a736c69ff610502c0a8d20a5656
BLAKE2b-256 056ee057efb05b9be2970334dbea09ef8fd18f585d24206b4cb15a07aca18375

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page