Skip to main content

High Level Expressions for Dask

Project description

Dask Expressions

Dask DataFrames with query optimization.

This is a rewrite of Dask DataFrame that includes query optimization and generally improved organization.

More in our blog posts:

Example

import dask_expr as dx

df = dx.datasets.timeseries()
df.head()

df.groupby("name").x.mean().compute()

Query Representation

Dask-expr encodes user code in an expression tree:

>>> df.x.mean().pprint()

Mean:
  Projection: columns='x'
    Timeseries: seed=1896674884

This expression tree will be optimized and modified before execution:

>>> df.x.mean().optimize().pprint()

Div:
  Sum:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
  Count:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884

Stability

This is the default backend for dask.DataFrame since version 2024.3.0.

API Coverage

Dask-Expr covers almost everything of the Dask DataFrame API. The only missing features are:

  • melt
  • named GroupBy Aggregations

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dask-expr-1.0.7.tar.gz (187.6 kB view details)

Uploaded Source

Built Distribution

dask_expr-1.0.7-py3-none-any.whl (184.7 kB view details)

Uploaded Python 3

File details

Details for the file dask-expr-1.0.7.tar.gz.

File metadata

  • Download URL: dask-expr-1.0.7.tar.gz
  • Upload date:
  • Size: 187.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for dask-expr-1.0.7.tar.gz
Algorithm Hash digest
SHA256 8ecd0414768e5ae08f229289ca30470ffb25498d7da20f5d3f1fd7a8624c8741
MD5 888757db6ff7e980de7d2b291d5f6d66
BLAKE2b-256 81133cd6e2150fe397a29bcf37fc4947511a7bd72d21c73205293307355aacb4

See more details on using hashes here.

Provenance

File details

Details for the file dask_expr-1.0.7-py3-none-any.whl.

File metadata

  • Download URL: dask_expr-1.0.7-py3-none-any.whl
  • Upload date:
  • Size: 184.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for dask_expr-1.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 4b8d2912c1592558c58f89c1d816f7efdef0850c2a99b7775928c49826853f77
MD5 182db38f08d1c4592c1fe0b52a09ef5a
BLAKE2b-256 e63094036dfd4e2f706ed6ff930de66c09e244f82da04d66633dae77e0a410f4

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page