24 projects
ibis-framework
The portable Python dataframe library
dask
Parallel PyData with Task Scheduling
gcsfs
Convenient Filesystem interface over GCS
crick
High performance approximate and streaming algorithms
conda-pack
Package conda environments for redistribution
partd
Appendable key-value storage
msgspec
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML.
dask-gateway
A client library for interacting with a dask-gateway server
dask-gateway-server
A multi-tenant server for securely deploying and managing multiple Dask clusters.
ibis-datasette
An ibis backend for querying datasette
skein
A simple tool and library for deploying applications on Apache YARN
dask-yarn
Deploy dask clusters on Apache YARN
quickle
A quicker pickle
ery
cachey
Caching mindful of computation/storage costs
hdfs3
Python wrappers for libhdfs3, a native HDFS client
jupyter-hdfscm
A Jupyter ContentsManager for HDFS
jupyterhub-kerberosauthenticator
A JupyterHub authenticator using Kerberos
jupyterhub-yarnspawner
JupyterHub Spawner for Apache Hadoop/YARN Clusters
hadoop-test-cluster
A CLI for managing hadoop clusters for testing
venv-pack
Package virtual environments for redistribution
knit
Python wrapper for YARN Applications
dask-searchcv
Tools for doing hyperparameter search with Scikit-Learn and Dask
ptime
IPython magic for parallel profiling