A command-line tool for using the Anon AI web service.
Project description
The Anon AI Toolbelt is a command line interface (CLI) tool for managing and anonymising data with the Anon AI web service. It’s developed in Python and the code is published under the MIT License at github.com/anon-ai/toolbelt.
Caution: the toolbelt is under active development. Core functionality works but commands and options are liable to change and some of the features that are documented don’t yet exist.
Installation
Install using pip into a Python3 environment:
pip install anon-ai-toolbelt
Note that the toolbelt only works with Python3 and installs dependencies including the Python Cryptography Toolkit.
Usage
The primary workflow is for a data controller to push data into the system and then for data processors to pull the data down in anonymised form.
Login
Login with your API credentials (writes to ~/.config/anon.ai/config.json):
anon login
> key: ...
> secret: ...
Push
Push a data snapshot up to ingest and store it.
anon push foo.dump mydb
When ingesting structured data you should specify the data format:
anon push foo.dump mydb --format postgres
In this example, mydb is an arbitrary resource name that you use to identify this ingested data source. Subsequent pushes to the same name are usually used to store a new snapshot of the same file or database.
The stored data is encrypted using AES-256 with a per-account encryption key that lives in (and never leaves) a secure vault. You can also optionally provide your own encryption key:
anon push foo.dump mydb --encryption-key LONG_RANDOM_STRING
Note that:
your encryption key is never persisted in our system – so you have to manage it and give it to any users that you want to share anonymised data with
there’s no strict requirement on length or format for your encryption key value (we SHA-256 hash it along with your per-account encryption key) but we recommend at least 16 bytes entropy
Pull
Pull down an anonymised copy of an ingested data snapshot:
anon pull mydb foo.dump
Optionally provide an encryption key (to decrypt the stored data with) and / or configure how you’d like it anonymised:
anon pull mydb foo.dump --config config.json --encryption-key ...
Pipe
Pipe data through to anonymise it:
anon pipe http://humanstxt.org/humans.txt /tmp/humans.anon.txt
This parses, analyses and anonymises the data on the fly, i.e.: without persisting it. The data source must currently be a URL.
Versions
You can pull specific snapshot versions by targeting them by name:
anon pull mydb --snapshot someid
You can also push snapshots up with a specific name:
anon push foo.sql mydb --snapshot someid
Tab completion
Enable bash completion by adding the following to your .bashrc:
eval "$(_ANON_COMPLETE=source anon)"
If you use zsh, you can emulate bash completion by first adding bashcompinit to your .zshrc:
autoload bashcompinit
bashcompinit
eval "$(_ANON_COMPLETE=source anon)"
For more information see Anon AI.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file anon-ai-toolbelt-0.2.10.tar.gz
.
File metadata
- Download URL: anon-ai-toolbelt-0.2.10.tar.gz
- Upload date:
- Size: 8.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ab7fedafce58182ccbb1b0f8fb2a286beefb2be407f86f420f7ddf41c2ac405a |
|
MD5 | ca7e935527c0e6a91b19bc1c3a5ea58d |
|
BLAKE2b-256 | 27d319d34745c154a3d3786e32d5af905a51dcd1ff91a089809eacc910c28542 |