Skip to main content

Kedro-Airflow makes it easy to deploy Kedro projects to Airflow

Project description

Kedro-Airflow

License Python Version PyPI Version Code Style: Black

Apache Airflow is a tool for orchestrating complex workflows and data processing pipelines. The Kedro-Airflow plugin can be used for:

  • Rapid pipeline creation in the prototyping phase. You can write Python functions in Kedro without worrying about schedulers, daemons, services or having to recreate the Airflow DAG file.
  • Automatic dependency resolution in Kedro. This allows you to bypass Airflow's need to specify the order of your tasks.
  • Distributing Kedro tasks across many workers. You can also enable monitoring and scheduling of the tasks' runtimes.

Installation

kedro-airflow is a Python plugin. To install it:

pip install kedro-airflow

Usage

You can use kedro-airflow to deploy a Kedro pipeline as an Airflow DAG by following these steps:

Step 1: Generate the DAG file

At the root directory of the Kedro project, run:

kedro airflow create

This command will generate an Airflow DAG file located in the airflow_dags/ directory in your project. You can pass a --pipeline flag to generate the DAG file for a specific Kedro pipeline and an --env flag to generate the DAG file for a specific Kedro environment.

Step 2: Copy the DAG file to the Airflow DAGs folder.

For more information about the DAGs folder, please visit Airflow documentation.

Step 3: Package and install the Kedro pipeline in the Airflow executor's environment

After generating and deploying the DAG file, you will then need to package and install the Kedro pipeline into the Airflow executor's environment. Please visit the guide to deploy Kedro as a Python package for more details.

FAQ

What if my DAG file is in a different directory to my project folder?

By default the generated DAG file is configured to live in the same directory as your project as per this template. If your DAG file is located in a different directory to your project, you will need to tweak this manually after running the kedro airflow create command.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kedro-airflow-0.5.0.tar.gz (5.0 kB view details)

Uploaded Source

Built Distribution

kedro_airflow-0.5.0-py3-none-any.whl (5.2 kB view details)

Uploaded Python 3

File details

Details for the file kedro-airflow-0.5.0.tar.gz.

File metadata

  • Download URL: kedro-airflow-0.5.0.tar.gz
  • Upload date:
  • Size: 5.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.0

File hashes

Hashes for kedro-airflow-0.5.0.tar.gz
Algorithm Hash digest
SHA256 408b122d47861cead0854e569c4fbe563aeaea13130c0c442b4950b32ded9c80
MD5 6d6c3d5b1c25a1da06150d348976794b
BLAKE2b-256 e03a9a8fb01b96407d6281d6b333e971857d1c45dfcc1df26bb079c55c878a00

See more details on using hashes here.

File details

Details for the file kedro_airflow-0.5.0-py3-none-any.whl.

File metadata

File hashes

Hashes for kedro_airflow-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6d1a3003ee09f54067104fa28d0c067018fd4efb1f933717391eeb21749bcb6b
MD5 9c2a7a72e18e3eaf0b34bf5bcbb4548f
BLAKE2b-256 0fd3d9efcf94b8de2f51b426f1acca13a3a42bf1e398e6095fbe5650894b8023

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page