packtivity - general purpose schema + bindings for PROV activities
Project description
packtivity
This package aims to collect implementations of both synchronous and asynchronous execution of preserved, but parametrized scientific computational tasks that come with batteries included, i.e. with a full specification of their software dependencies. In that sense they are packaged activities -- packtivities.
This package provides tools to validate and execute data processing tasks that are written according to the "packtivity" JSON schemas defined in https://github.com/diana-hep/yadage-schemas.
Packtivities define
- the software environment
- parametrized process descriptions (what programs to run within these environment) and
- produces human and machine readable outputs (as JSON) of the resulting data fragments.
At run-time they are paired with a concrete set of parameters supplied as JSON documents and and external storage/state to actually execute these tasks.
Packtivity in Yadage
This package is used by https://github.com/lukasheinrich/yadage to execute the individual steps of yadage workflows.
Example Packtivity spec
This packtivity spec is part of a number of yadage workflow and runs the Delphes detector simulation on a HepMC file and outputs events in the LHCO and ROOT file formats. This packtivity is (stored in a public location)[https://github.com/lukasheinrich/yadage-workflows/blob/master/phenochain/delphes.yml] from which it can be later retrieved:
process:
process_type: 'string-interpolated-cmd'
cmd: 'DelphesHepMC {delphes_card} {outputroot} {inputhepmc} && root2lhco {outputroot} {outputlhco}'
publisher:
publisher_type: 'frompar-pub'
outputmap:
lhcofile: outputlhco
rootfile: outputroot
environment:
environment_type: 'docker-encapsulated'
image: lukasheinrich/root-delphes
Usage
You can run the packtivity in a synchronous way by specifying the spec (can point to GitHub), all necessary parameters and attaching an external state (via the --read
and --write
flags).
packtivity-run -t from-github/phenochain delphes.yml \
-p inputhepmc="$PWD/pythia/output.hepmc" \
-p outputroot="'{workdir}/output.root'" \
-p outputlhco="'{workdir}/output.lhco'" \
-p delphes_card=delphes/cards/delphes_card_ATLAS.tcl \
--read pythia --write outdir
Asynchronous Backends
In order to facilitate usage of distributed resources, a number of Asynchronous backends can be specified. Here is an example for IPython Parallel clusters
packtivity-run -b ipcluster --asyncwait \
-t from-github/phenochain delphes.yml \
-p inputhepmc="$PWD/pythia/output.hepmc" \
-p outputroot="'{workdir}/output.root'" \
-p outputlhco="'{workdir}/output.lhco'" \
-p delphes_card=delphes/cards/delphes_card_ATLAS.tcl \
--read pythia --write outdir
You can replacing the --asyncwait
with --async
flag in order to get a JSONable proxy representation with which to later on check on the job status. By default the proxy information is written to proxy.json
(customizable via the -x
flag):
packtivity-run -b celery --async \
-t from-github/phenochain delphes.yml \
-p inputhepmc="$PWD/pythia/output.hepmc" \
-p outputroot="'{workdir}/output.root'" \
-p outputlhco="'{workdir}/output.lhco'" \
-p delphes_card=delphes/cards/delphes_card_ATLAS.tcl \
--read pythia --write outdir
And at a later point in time you can check via:
packtivity-checkproxy proxy.json
External Backends
Users can implement their own backends to handle the JSON documents describing the packtivities. It can be enabled
by using the fromenv
backend and setting an environment variable specifying the module holding the backend and proxy
classes. The format of the environment variable is module:backendclass:proxyclass
. E.g.:
export PACKTIVITY_ASYNCBACKEND="externalbackend:ExternalBackend:ExternalProxy"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file packtivity-0.15.0.tar.gz
.
File metadata
- Download URL: packtivity-0.15.0.tar.gz
- Upload date:
- Size: 34.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 170e89b2d8868f411b5ee20ccc5842decd859a6d4a25149ff3f69c09309dafe3 |
|
MD5 | bcd46057db63a36c936a21d6c32df2ae |
|
BLAKE2b-256 | 20bb3f7b9406f81288eea20e00214416f616ea766d3861a331b46d4b2d86dae1 |
File details
Details for the file packtivity-0.15.0-py3-none-any.whl
.
File metadata
- Download URL: packtivity-0.15.0-py3-none-any.whl
- Upload date:
- Size: 36.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c6d3bdc3e55d86abd8ee7ae14f386fbae655ffd42e54468f01b7daf853f84f5 |
|
MD5 | 4653606d338e88f42d87854b4f89ae14 |
|
BLAKE2b-256 | 86eb92d6466c2da0d3480f9772f140745a582b65724d73adec06c91895fb8f21 |