a build tool for data
Project description
make for your data.
An automation tool for data manipulation.
Inspired by Open Refine.
The general principles in Databuild are:
Low entry barrier
Easy to install
Easy to grasp
Extensible
Databuild can be useful for scenarios such as:
Documenting data transformations in your infoviz project
Automate data processing in a declarative way
Installation
Install databuild:
$ pip install databuild
Quickstart
For more details, see the Extended Documentation.
$ data-build.py buildfile.json
buildfile.yaml contains a list of operations to be performed on data. Think of it as a script for a spreadsheet.
An example of build file could be:
- operation: sheets.import_data
description: Importing data from csv file
params:
sheet: dataset1
format: csv
filename: dataset1.csv
skip_last_lines: 1
- operation: columns.add_column
description: Calculate the gender ratio
params:
sheet: dataset1
name: Gender Ratio
expression:
language: python
content: "return float(row['Totale Maschi']) / float(row['Totale Femmine'])"
- operation: sheets.export_data
description: save the data
params:
sheet: dataset1
format: csv
filename: dataset2.csv
JSON buildfiles are also supported. databuild will guess the type based on the extension.
License
Licensed under BSD 3-clauses.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file databuild-0.0.10.tar.gz
.
File metadata
- Download URL: databuild-0.0.10.tar.gz
- Upload date:
- Size: 17.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fa57676cbc7c5b1310d2b0b6abc8a7877cc6e0ad1525224abfae0e57502defef |
|
MD5 | 695bf1cb87bcf8d41b8acc21fe1d5348 |
|
BLAKE2b-256 | fc4240b98091e5d629bd5f2e7beb68a6f8caddbb3af14876d9e19d666b421ae2 |