High Level Expressions for Dask
Reason this release was yanked:
Wrong Dask Version Pin
Project description
Dask Expressions
Dask DataFrames with query optimization.
This is a proof-of-concept rewrite of Dask DataFrame that includes query optimization and generally improved organization.
More in our blog posts:
Example
import dask_expr as dx
df = dx.datasets.timeseries()
df.head()
df.groupby("name").x.mean().compute()
Query Representation
Dask-expr encodes user code in an expression tree:
>>> df.x.mean().pprint()
Mean:
Projection: columns='x'
Timeseries: seed=1896674884
This expression tree will be optimized and modified before execution:
>>> df.x.mean().optimize().pprint()
Div:
Sum:
Fused(375f9):
| Projection: columns='x'
| Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
Count:
Fused(375f9):
| Projection: columns='x'
| Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
Stability
This project is a work in progress and will be changed without notice or deprecation warning. Please provide feedback, but it's best to avoid use in production settings.
API Coverage
Dask-Expr covers almost everything of the Dask DataFrame API. The only missing features are:
melt
- named GroupBy Aggregations
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dask-expr-0.5.0.tar.gz
.
File metadata
- Download URL: dask-expr-0.5.0.tar.gz
- Upload date:
- Size: 173.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 72bdbd8dbfd78693b8908e355466b60b2d9ec1a6bd9ae79aa7ed6645bcb6786a |
|
MD5 | 4d4b7745beae9142f0b10360043526c9 |
|
BLAKE2b-256 | 960299aff8835e12f762c0d87d1c65782c4bd58d22937ec16b4558ed802f7534 |
Provenance
File details
Details for the file dask_expr-0.5.0-py3-none-any.whl
.
File metadata
- Download URL: dask_expr-0.5.0-py3-none-any.whl
- Upload date:
- Size: 169.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8cff973a013fd89c6c7427f9e7678d8e54d813ab7647701232ab70260c2e91a7 |
|
MD5 | f7f3ae131d9a4c79c73988b5bdd40a8a |
|
BLAKE2b-256 | cac0c1ba0f718dfba694d27149153305ad2d97cc9943d1c7868963bfbe5ce614 |