Skip to main content

A generic pipeline that can be used to map raw data to RDF.

Project description

data2rdf

A pipeline for generating data representation in RDF out of raw data given in ASCII, CSV or EXCEL format.

https://data2rdf.readthedocs.io/en/latest/

Installation

Install for using the package

Either install the package from the pypi

pip install data2rdf

Install for development

Install the package from the source code:

git clone git@github.com:MI-FraunhoferIWM/data2rdf.git
cd data2rdf
pip install -e .

Windows specific

In windows it might be necessary to install curses manually. This can be done with:

pip install windows-curses

Debug a Chowlk XML File

In some cases when the syntax of the draw.io file is not correct (e.g.: missing label on arrow, bracket in class file) chowlk crashes. The only (pretty annoying but working) way to find the wrong syntax is to execute chowlk with the command line for that file and inclemently remove elements from the draw.io diagram. This way you can find the wrong syntax by process of elimination.

Version Updates

  • Chowlk is installed via pip through the dependencies
  • Running csv and excel pipeline
  • Abox pipeline CLI
  • Unnittest for csv and excel pipeline and abox pipeline

Building the docs locally

HTML

A server will start, generate the docs and listen for changes in the source files. This can be done by using docker or installing the development environment directly on the you machine. Next are installation guides for Docker and Linux OS.

Docker

First, build the Docker image by running the following command:

$ docker build -f docs.Dockerfile -t data2rdf-docs .

Then, start the program by running:

$ docker run -it --rm -v $PWD:/app -p 8000:8000 data2rdf-docs

Linux

At an OS level (these commands work on Linux Debian):

$ sudo apt install pandoc graphviz default-jre
$ sudo apt-get install texlive-latex-recommended \
                       texlive-latex-extra \
                       texlive-fonts-recommended \
                       latexmk

The python dependencies:

$ pip install .[docs]

Now you can start the server and render the docs:

$ sphinx-autobuild docs/source docs/build/html

The documentation will be available on http://127.0.0.1:8000.

PDF (LaTeX)

To generate a PDF of the documentation, simply run (from the root project folder):

make -C docs latexpdf

The generated PDF can be found under docs/build/latex/data2rdf_docs.pdf

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data2rdf-1.0.4.tar.gz (28.1 kB view details)

Uploaded Source

Built Distribution

data2rdf-1.0.4-py3-none-any.whl (33.7 kB view details)

Uploaded Python 3

File details

Details for the file data2rdf-1.0.4.tar.gz.

File metadata

  • Download URL: data2rdf-1.0.4.tar.gz
  • Upload date:
  • Size: 28.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for data2rdf-1.0.4.tar.gz
Algorithm Hash digest
SHA256 576ab755c3e12bc28a8762bb2b7d648ed40c74cdf0c7fe4644a3ce75286c1d9d
MD5 bf0e6a7d9055f17d9651bd9f1ab8223b
BLAKE2b-256 7f01f6b28ab4e0e37cbb2a2a1fdbda795ea7770d5cd0a0a4fc111c1a6972f942

See more details on using hashes here.

Provenance

File details

Details for the file data2rdf-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: data2rdf-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 33.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for data2rdf-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 2ae1eae452b5bdfeb3e04104effb41cb96a9d8cd526d664b7af65d807073515f
MD5 242500b6594e08b945700e7a0d83c19a
BLAKE2b-256 984b415aa9fe2c17f5e87a7b4fac356acc22816e1cb82b5cba1c6b4fe4f99045

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page