A Python package that provides helpers for cleaning, deduplication, enrichment, etc. in Spark
Project description
Spark ETL Python
A Python package that provides helpers for cleaning, deduplication, enrichment, etc. in Spark
Free software: MIT license
Documentation: https://spark-etl-python.readthedocs.io.
Features
TODO
Develop
In order to be able to develop on this package:
Create a virtual environment
Install pip-tools: pip install pip-tools
Run pip-sync requirements_dev.txt requirements.txt
To update dependencies, add them to requirements.in (if they are needed to run the package) or requirements_dev.in. Then run pip-compile requirements.in or pip-compile requirements_dev.in.
Credits
This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.
History
0.1.0 (2018-10-19)
First release on PyPI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for spark_etl_python-0.1.5-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 88a12e33433993fdfbb97ed5caa72b838cee000b2e9f48012e524bedc1d00454 |
|
MD5 | 3da6e63d45636b2f2cb8096c8d9e05a4 |
|
BLAKE2b-256 | 35fb877892a5ba2f3d0f2f4b700f3284ddbcaac9fae0ce9f7995b584e30f90a8 |