Skip to main content

High Level Expressions for Dask

Project description

Dask Expressions

Dask DataFrames with query optimization.

This is a rewrite of Dask DataFrame that includes query optimization and generally improved organization.

More in our blog posts:

Example

import dask_expr as dx

df = dx.datasets.timeseries()
df.head()

df.groupby("name").x.mean().compute()

Query Representation

Dask-expr encodes user code in an expression tree:

>>> df.x.mean().pprint()

Mean:
  Projection: columns='x'
    Timeseries: seed=1896674884

This expression tree will be optimized and modified before execution:

>>> df.x.mean().optimize().pprint()

Div:
  Sum:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
  Count:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884

Stability

This is the default backend for dask.DataFrame since version 2024.3.0.

API Coverage

Dask-Expr covers almost everything of the Dask DataFrame API. The only missing features are:

  • named GroupBy Aggregations

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dask_expr-1.1.4.tar.gz (189.2 kB view details)

Uploaded Source

Built Distribution

dask_expr-1.1.4-py3-none-any.whl (205.9 kB view details)

Uploaded Python 3

File details

Details for the file dask_expr-1.1.4.tar.gz.

File metadata

  • Download URL: dask_expr-1.1.4.tar.gz
  • Upload date:
  • Size: 189.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for dask_expr-1.1.4.tar.gz
Algorithm Hash digest
SHA256 751409a304e697fbd920e3035bdc9af2111194dff5e699bfb25a193286f1e606
MD5 1627bbbb0c1972d2a04f22c450f383cd
BLAKE2b-256 e501989d0e81d61a145564c2ce6622f1a6f5a1e8f3f77f7b7452f4b08c0c6720

See more details on using hashes here.

Provenance

File details

Details for the file dask_expr-1.1.4-py3-none-any.whl.

File metadata

  • Download URL: dask_expr-1.1.4-py3-none-any.whl
  • Upload date:
  • Size: 205.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for dask_expr-1.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 f517098b650d622b06363848dfcccf833ead7ec2c4c67c0831aa7e5db23347ff
MD5 bba9dd57f41deeeb1efb85601c30fa0e
BLAKE2b-256 d80666bbb08d6435735e66dbbf25d72fe16562dca64a98e76e16d6e066edf08f

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page