Skip to main content

A generic pipeline that can be used to map raw data to RDF.

Project description

data2rdf

A pipeline for generating data representation in RDF out of raw data given in ASCII, CSV, JSON or EXCEL format.

https://data2rdf.readthedocs.io/en/latest/

Coverage

Coverage Report
FileStmtsMissCoverMissing
data2rdf
   init.py50100% 
   config.py180100% 
   utils.py3366 82%
   warnings.py20100% 
data2rdf/models
   init.py30100% 
   base.py4744 91%
   graph.py1222626 79%
   mapping.py300100% 
data2rdf/modes
   init.py40100% 
data2rdf/parsers
   init.py60100% 
   base.py1341111 92%
   csv.py1651919 88%
   excel.py1591313 92%
   json.py1432020 86%
   utils.py6788 88%
data2rdf/pipelines
   init.py20100% 
   main.py8299 89%
data2rdf/qudt
   init.py00100% 
   utils.py421212 71%
TOTAL106412888% 

Installation

Install for using the package

Either install the package from the pypi

pip install data2rdf

Install for development

Install the package from the source code:

git clone git@github.com:MI-FraunhoferIWM/data2rdf.git
cd data2rdf
pip install -e .

Unit tests

Before running the unit tests, please install the needed packages:

pip install data2rdf[tests]

Afterwards, run the unittest with:

pytest

Building the docs locally

HTML

A server will start, generate the docs and listen for changes in the source files. This can be done by using docker or installing the development environment directly on the you machine. Next are installation guides for Docker and Linux OS.

Docker

First, build the Docker image by running the following command:

$ docker build -f docs.Dockerfile -t data2rdf-docs .

Then, start the program by running:

$ docker run -it --rm -v $PWD:/app -p 8000:8000 data2rdf-docs

Linux

At an OS level (these commands work on Linux Debian):

$ sudo apt install pandoc graphviz default-jre
$ sudo apt-get install texlive-latex-recommended \
                       texlive-latex-extra \
                       texlive-fonts-recommended \
                       latexmk

The python dependencies:

$ pip install .[docs]

Now you can start the server and render the docs:

$ sphinx-autobuild docs/source docs/build/html

The documentation will be available on http://127.0.0.1:8000.

PDF (LaTeX)

To generate a PDF of the documentation, simply run (from the root project folder):

make -C docs latexpdf

The generated PDF can be found under docs/build/latex/data2rdf_docs.pdf

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data2rdf-2.0.2.tar.gz (33.2 kB view details)

Uploaded Source

Built Distribution

data2rdf-2.0.2-py3-none-any.whl (56.7 kB view details)

Uploaded Python 3

File details

Details for the file data2rdf-2.0.2.tar.gz.

File metadata

  • Download URL: data2rdf-2.0.2.tar.gz
  • Upload date:
  • Size: 33.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for data2rdf-2.0.2.tar.gz
Algorithm Hash digest
SHA256 2749bd5effbefbea7f4caa17dd84a345ce933c9977da2ecc5c9e03d851728c1d
MD5 c4decad150647670c00d0589bb1edb50
BLAKE2b-256 eac1b7c4c2a6addf04ff65c1781f0026e1ccf47147145607a09c917667e935d5

See more details on using hashes here.

Provenance

File details

Details for the file data2rdf-2.0.2-py3-none-any.whl.

File metadata

  • Download URL: data2rdf-2.0.2-py3-none-any.whl
  • Upload date:
  • Size: 56.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for data2rdf-2.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e4c57e0e9bfd6a41495a7f1c00423fbc24eb83d1827804f29f6842d98f659c6b
MD5 5e5d64a5b2ca786dcccb6495691e5771
BLAKE2b-256 a0bbc600509c88b18ca811553ca0e37aa9919384e7db7687c2f5f2a0ef41516a

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page