High Level Expressions for Dask

These details have been verified by PyPI

Maintainers

fjetter jrbourbeau phofl rjzamora

These details have not been verified by PyPI

Project links

Source code

Project description

Dask Expressions

Dask DataFrames with query optimization.

This is a proof-of-concept rewrite of Dask DataFrame that includes query optimization and generally improved organization.

Example

import dask_expr as dx

df = dx.datasets.timeseries()
df.head()

df.groupby("name").x.mean().compute()

Query Representation

Dask-expr encodes user code in an expression tree:

>>> df.x.mean().pprint()

Mean:
  Projection: columns='x'
    Timeseries: seed=1896674884

This expression tree will be optimized and modified before execution:

>>> df.x.mean().optimize().pprint()

Div:
  Sum:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
  Count:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884

Stability

This project is a work in progress and will be changed without notice or deprecation warning. Please provide feedback, but it's best to avoid use in production settings.

API Coverage

Dask-Expr covers almost everything of the Dask DataFrame API. The only missing features are:

melt
named GroupBy Aggregations

Project details

These details have been verified by PyPI

Maintainers

fjetter jrbourbeau phofl rjzamora

These details have not been verified by PyPI

Project links

Source code

Release history Release notifications | RSS feed

1.1.17

Nov 8, 2024

1.1.16

Oct 17, 2024

1.1.15

Sep 28, 2024

1.1.14

Sep 13, 2024

1.1.13

Sep 2, 2024

1.1.12

Aug 30, 2024

1.1.11

Aug 16, 2024

1.1.10

Aug 6, 2024

1.1.9

Jul 20, 2024

1.1.8

Jul 19, 2024

1.1.7

Jul 5, 2024

1.1.6

Jun 21, 2024

1.1.5

Jun 20, 2024

1.1.4

Jun 19, 2024

1.1.3

Jun 14, 2024

1.1.2

May 31, 2024

1.1.1

May 17, 2024

1.1.0

May 3, 2024

1.0.14

Apr 30, 2024

1.0.13

Apr 25, 2024

1.0.12

Apr 19, 2024

1.0.11

Apr 9, 2024

1.0.10

Apr 4, 2024

1.0.9

Apr 2, 2024

1.0.7

Apr 2, 2024

1.0.6

Apr 1, 2024

1.0.5

Mar 22, 2024

1.0.4

Mar 18, 2024

1.0.3

Mar 15, 2024

1.0.2

Mar 14, 2024

1.0.1

Mar 12, 2024

1.0

Mar 12, 2024

0.5.3

Feb 28, 2024

0.5.2

Feb 26, 2024

0.5.1

Feb 23, 2024

0.5.0 yanked

Feb 23, 2024

Reason this release was yanked:

Wrong Dask Version Pin

0.4.2

Feb 12, 2024

0.4.1

Feb 10, 2024

This version

0.4.0

Feb 1, 2024

0.3.5

Jan 18, 2024

0.3.4

Jan 12, 2024

0.3.3

Jan 10, 2024

0.3.2

Jan 5, 2024

0.3.1

Dec 19, 2023

0.3.0

Dec 15, 2023

0.2.9

Dec 12, 2023

0.2.8

Dec 8, 2023

0.2.7

Dec 5, 2023

0.2.6

Dec 1, 2023

0.2.5

Nov 29, 2023

0.2.4

Nov 28, 2023

0.2.3

Nov 22, 2023

0.2.2

Nov 21, 2023

0.2.1

Nov 20, 2023

0.2.0

Nov 20, 2023

0.1.12

Nov 2, 2023

0.1.11

Oct 20, 2023

0.1.10

Oct 17, 2023

0.1.9

Oct 12, 2023

0.1.8

Oct 4, 2023

0.1.7

Sep 25, 2023

0.1.6

Sep 20, 2023

0.1.5

Aug 18, 2023

0.1.4

Aug 12, 2023

0.1.3

Aug 4, 2023

0.1.2

Jul 28, 2023

0.1.1

Jul 21, 2023

0.1.0

Jul 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dask-expr-0.4.0.tar.gz (147.2 kB view hashes)

Uploaded Feb 1, 2024 Source

Built Distribution

dask_expr-0.4.0-py3-none-any.whl (161.7 kB view hashes)

Uploaded Feb 1, 2024 Python 3

Hashes for dask-expr-0.4.0.tar.gz

Hashes for dask-expr-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`ee86ac5a5d3a892341af7ffab58e3a579c12aacbe332f2fe7477f668ac260279`
MD5	`6d1df4358c09b806758ad582fa4db5a2`
BLAKE2b-256	`00ec7faf579264a8889afd26d6e9b0617ab88f6cd87dff9244c469eb6e08e71f`

Hashes for dask_expr-0.4.0-py3-none-any.whl

Hashes for dask_expr-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a2e37fa0fa52afee7ee4822062103bd30820eeedb80771dc7acbeaa6fa2cb92f`
MD5	`e9d2a0506e13f5e900a69e32283b060f`
BLAKE2b-256	`f06fae02288f42407125242db829003b237af160e152738c0b3f09387decf7eb`