Skip to main content

Wrapper for Great Expectations to fit the requirements of the Gemeente Amsterdam.

Project description

Introduction

DISCLAIMER: Repo is in PoC phase DISCLAIMER: The functions can run on Databricks using a Personal Compute Cluster

This repository contains functions that will ease the use of Great Expectations. Users can input data and data quality rules and get rules in return.

Getting Started

Prerequisites:

Run the following code in your workspace:

pip install great_expectations

When working in Databricks you can clone this repo to Databricks Repos. Then you can access it in your workspace using:

import sys sys.path.append("/Workspace/Repos/{user}/{repo_name}") from {file} import {function}

Parameter examples: user: j.cruijff@amsterdam.nl repo_name: dq_repo file: df_checker function: df_check

Updates

version = "0.1.0" : dq_rules_example.json is updated. Added: "dataframe_parameters": { "unique_identifier": "id" }

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dq-suite-amsterdam-0.1.0.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

dq_suite_amsterdam-0.1.0-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file dq-suite-amsterdam-0.1.0.tar.gz.

File metadata

  • Download URL: dq-suite-amsterdam-0.1.0.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for dq-suite-amsterdam-0.1.0.tar.gz
Algorithm Hash digest
SHA256 53e3b02d7e9bda954af4b16cfb2e6210e421a3111767e254727a9b6ea7d002f1
MD5 33e13be51935656d1266a1400166bba1
BLAKE2b-256 e4aa41d26233bb871e7fe2736c93df8d081138668f4eafcfd56aaacfc38c2c68

See more details on using hashes here.

File details

Details for the file dq_suite_amsterdam-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for dq_suite_amsterdam-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 26441c684c24a9192eec13a0458b93f0c1bca0b2e737183bdc8f9c4366c3aa16
MD5 f2f5a4cd4436b648aef34c2b0a5514e4
BLAKE2b-256 0bf899769b69e644ffde48a22dfc257f5c53614223e3596d9f9fc08dd92206cb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page