DataLad extension for semantic metadata handling
Project description
# DataLad extension for semantic metadata handling
[![Travis tests status](https://secure.travis-ci.org/datalad/datalad-metalad.png?branch=master)](https://travis-ci.org/datalad/datalad-metalad) [![Build status](https://ci.appveyor.com/api/projects/status/8jtp2fp3mwr5huyi/branch/master?svg=true)](https://ci.appveyor.com/project/mih/datalad-metalad) [![codecov.io](https://codecov.io/github/datalad/datalad-metalad/coverage.svg?branch=master)](https://codecov.io/github/datalad/datalad-metalad?branch=master) [![GitHub release](https://img.shields.io/github/release/datalad/datalad-metalad.svg)](https://GitHub.com/datalad/datalad-metalad/releases/) [![PyPI version fury.io](https://badge.fury.io/py/datalad-metalad.svg)](https://pypi-hypernode.com/pypi/datalad-metalad/) [![Documentation](https://readthedocs.org/projects/datalad-metalad/badge/?version=latest)](http://docs.datalad.org/projects/metalad/en/latest)
This software is a [DataLad](http://datalad.org) extension that equips DataLad with an alternative command suite for metadata handling (extraction, aggregation, reporting). It is backward-compatible with the metadata storage format in DataLad proper, while being substantially more performant (especially on large dataset hierarchies). Additionally, it provides new metadata extractors and improved variants of DataLad’s own ones that are tuned for better performance and richer, JSON-LD compliant metadata reports.
Command(s) currently provided by this extension
meta-extract – new and improved dedicated command to run any and all of DataLad’s metadata extractors.
meta-aggregate – complete reimplementation of metadata aggregation, with stellar performance benefits, in particular on large dataset hierarchies.
meta-dump – new command to specifically access the aggregated metadata present in a dataset, much faster and more predictable behavior than the metadata command in datalad-core.
Additional metadata extractor implementations
metalad_core – enriched variant of the datalad_core extractor that yields valid JSON-LD
metalad_annex – refurbished variant of the annex extractor using the metalad extractor API
metalad_custom – read pre-crafted metadata from shadow/side-care files for a dataset and/or any file in a dataset.
metalad_runprov – report provenance metadata for datalad run records following the [W3C PROV](https://www.w3.org/TR/prov-overview) model
## Installation
Before you install this package, please make sure that you [install a recent version of git-annex](https://git-annex.branchable.com/install). Afterwards, install the latest version of datalad-metalad from [PyPi](https://pypi-hypernode.com/project/datalad-metalad). It is recommended to use a dedicated [virtualenv](https://virtualenv.pypa.io):
# create and enter a new virtual environment (optional) virtualenv –system-site-packages –python=python3 ~/env/datalad . ~/env/datalad/bin/activate
# install from PyPi pip install datalad_metalad
## Support
For general information on how to use or contribute to DataLad (and this extension), please see the [DataLad website](http://datalad.org) or the [main GitHub project page](http://datalad.org). The documentation is found here: http://docs.datalad.org/projects/metalad
All bugs, concerns and enhancement requests for this software can be submitted here: https://github.com/datalad/datalad-metalad/issues
If you have a problem or would like to ask a question about how to use DataLad, please [submit a question to NeuroStars.org](https://neurostars.org/tags/datalad) with a datalad tag. NeuroStars.org is a platform similar to StackOverflow but dedicated to neuroinformatics.
All previous DataLad questions are available here: http://neurostars.org/tags/datalad/
## Acknowledgements
DataLad development is supported by a US-German collaboration in computational neuroscience (CRCNS) project “DataGit: converging catalogues, warehouses, and deployment logistics into a federated ‘data distribution’” (Halchenko/Hanke), co-funded by the US National Science Foundation (NSF 1429999) and the German Federal Ministry of Education and Research (BMBF 01GQ1411). Additional support is provided by the German federal state of Saxony-Anhalt and the European Regional Development Fund (ERDF), Project: Center for Behavioral Brain Sciences, Imaging Platform. This work is further facilitated by the ReproNim project (NIH 1P41EB019936-01A1).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file datalad_metalad-0.2.1.tar.gz
.
File metadata
- Download URL: datalad_metalad-0.2.1.tar.gz
- Upload date:
- Size: 79.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 70fe423136a168f7630b3e0ff1951e776d61e7d5f36670bddf24299ac0870285 |
|
MD5 | 921c60339bf6e95f078239668f439a1b |
|
BLAKE2b-256 | b7a44dd9b6aeeb070e390b2219cf932043d628413aa21e2ea2fd6eb3ef46202a |
File details
Details for the file datalad_metalad-0.2.1-py2.py3-none-any.whl
.
File metadata
- Download URL: datalad_metalad-0.2.1-py2.py3-none-any.whl
- Upload date:
- Size: 78.0 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.46.1 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 40b072d8fdf97ca6d820e2e4c836b762df1de340f637180ee1fdd8338f2a57c3 |
|
MD5 | f214457fb3b5d152d5b6a82d6d0b8e37 |
|
BLAKE2b-256 | 8bba2d7a77a57e6048b2d98acfd40241ca7ff376be20430c764de468784acb9a |