Skip to main content

No project description provided

Project description

dask-databricks

Cluster tools for running Dask on Databricks multi-node clusters.

Quickstart

To launch a Dask cluster on Databricks you need to create an init script with the following contents and configure your multi-node cluster to use it.

#!/bin/bash

# Install Dask + Dask Databricks
/databricks/python/bin/pip install --upgrade dask[complete] git+https://github.com/jacobtomlinson/dask-databricks.git@main

# Start Dask cluster components
dask databricks run

Then from your Databricks Notebook you can quickly connect a Dask Client to the scheduler running on the Spark Driver Node.

import dask_databricks

client = dask_databricks.get_client()

Now you can submit work from your notebook to the multi-node Dask cluster.

def inc(x):
    return x + 1

x = client.submit(inc, 10)
x.result()

Dashboard

You can access the Dask dashboard via the Databricks driver-node proxy. The link can be found in Client or DatabricksCluster repr or via client.dashboard_link.

>>> print(client.dashboard_link)
https://dbc-dp-xxxx.cloud.databricks.com/driver-proxy/o/xxxx/xx-xxx-xxxx/8087/status

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dask_databricks-0.2.0.tar.gz (8.5 kB view details)

Uploaded Source

Built Distribution

dask_databricks-0.2.0-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file dask_databricks-0.2.0.tar.gz.

File metadata

  • Download URL: dask_databricks-0.2.0.tar.gz
  • Upload date:
  • Size: 8.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.6

File hashes

Hashes for dask_databricks-0.2.0.tar.gz
Algorithm Hash digest
SHA256 b348182028cb01658222d477a84acd658f506948dad2b1f7275d4524840f2332
MD5 f28939ce70f8101421783b1d230dbcaa
BLAKE2b-256 113128d6995f3d48df09000106471543f5203cc0ef88dc6351a9b7e760b38a45

See more details on using hashes here.

Provenance

File details

Details for the file dask_databricks-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for dask_databricks-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c8a48a3c5b08e7fb263258f6612c1717aeded991adaac90ef763a850a3c62786
MD5 81ad2548b07aac1906ea4c4b8862fbf1
BLAKE2b-256 69f4ff94e7fcac0ce18e15677a768b31cc51719d4849d2cf0db41ae6cef767e3

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page