Pipeline tools for building and publishing analysis ready datasets
Project description
pangeo-forge
pangeo-forge is an open-source tool designed to aid the extraction, transformation, and loading of datasets. The goal of pangeo-forge is to make it easy to extract datasets from traditional data repositories and deposit them into cloud object storage in analysis-ready, cloud-optimized format.
pangeo-forge is inspired by conda-forge, a community-led collection of recipes for building Conda packages. We hope that pangeo-forge can play the same role for datasets.
Documentation
More can be learned about pangeo-forge, its progress, and related subprojects in its official documentation.
Contributing
pangeo-forge is still early in development - there are several ways to contribute:
- Create a recipe for a dataset you are interested in
- Open an issue or pull request here or in any of the related subprojects (pangeo-smithy, staged-recipes)
- Check out the project roadmap
Get in touch
Discussions on pangeo-forge are generally hosted biweekly on Mondays at 7pm UTC via Whereby. More details on the scheduling of these meetings can be found here.
License
This project is licensed under the Apache License, Version 2.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pangeo_forge-0.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6215386f234e33daee65197445a4158e4270b811c3a888325ef789982ba3467d |
|
MD5 | 97aae5f2a92e8928bb32af8bbd1721fb |
|
BLAKE2b-256 | 3a6adbc49c6fe842026f530e123ab96d9c9633819f58f6fc125993f165f2d3d4 |