spinta

A platform for describing, extracting, transforming, loading and serving open data.

These details have not been verified by PyPI

Project links

Project description

Spinta is a command line tool and REST JSON API service for publishing and mapping data between different physical data models, JSON API and semantic data models. It supports a great deal of data schemes and formats.

https://gitlab.com/atviriduomenys/spinta/badges/master/pipeline.svg

https://gitlab.com/atviriduomenys/spinta/badges/master/coverage.svg

Physical data        Different       Real time         REST JSON
   sources            formats      transformation         API

                      +-----+
                 +--> | SQL | --------->|
   +------+      |    +-----+           |
   | file | ---->|                      |
   +------+      |    +-----+           |
                 +--> | CSV | --------->|
  +--------+     |    +-----+           |         +---------------+
  | DB API | --->|                      | ------->| REST JSON API |
  +--------+     |    +------+          |         +---------------+
                 |    | JSON | -------->|
 +----------+    |    +------+          |
 | REST API | -->|                      |
 +----------+    |    +-----+           |
                 +--> | XML | --------->|
                      +-----+

Purpose

Describe your data: You can automatically generate data structure description table (Manifest) from many different data sources.
Extract your data: Once you have your data structure in Manifest tables, you can extract data from multiple external data sources. Extracted data are validated and transformed using rules defined in Manifest table. Finally, data can be stored into internal database in order to provide fast and flexible access to data.
Transform your data: Data transformations are applied in real time, when reading data from source. This puts some limitations on transformation side, but allows data to be streamed in real time.
Publish your data: Once you have your data loaded into internal database, you can publish data using API. API is generated automatically using Manifest tables and provides extracted data in many different formats. For example if original data source was a CSV file, now you have a flexible API, that can talk JSON, RDF, SQL, CSV and other formats.

Features

Simple 15 column table format for describing data structures (you can use any spreadsheet software to manage metadata of your data)
Internal data storage with pluggable backends (PostgreSQL or Mongo)
Build-in async API server built on top of Starlette for data publishing
Simple web based data browser.
Convenient command-line interface
Public or restricted API access via OAuth protocol using build-in access management.
Simple DSL for querying, transforming and validating data.
Low memory consumption for data of any size
Support for many different data sources
Advanced data extraction even from dynamic API.
Compatible with DCAT and Frictionless Data Specifications.

Example

If you have an SQLite database:

$ sqlite3 sqlite.db <<EOF
CREATE TABLE COUNTRY (
    NAME TEXT
);
EOF

You can get a limited API and simple web based data browser with a single command:

$ spinta run -r sql sqlite:///sqlite.db

Then you can generate metadata table (manifest) like this:

$ spinta inspect -r sql sqlite:///sqlite.db
d | r | b | m | property | type   | ref | source              | prepare | level | access | uri | title | description
dataset                  |        |     |                     |         |       |        |     |       |
  | sql                  | sql    |     | sqlite:///sqlite.db |         |       |        |     |       |
                         |        |     |                     |         |       |        |     |       |
  |   |   | Country      |        |     | COUNTRY             |         |       |        |     |       |
  |   |   |   | name     | string |     | NAME                |         | 3     | open   |     |       |

Generated data structure table can be saved into a CSV file:

$ spinta inspect -r sql sqlite:///sqlite.db -o manifest.csv

Missing peaces in metadata can be filled using any Spreadsheet software.

Once you done editing metadata, you can test it via web based data browser or API:

$ spinta run --mode external manifest.csv

Once you are satisfied with metadata, you can generate a new metadata table for publishing, removing all traces of original data source:

$ spinta copy --no-source --access open manifest.csv manifest-public.csv

Now you have matadata for publishing, but all things about original data source are gone. In order to publish data, you need to copy external data to internal data store. To do that, first you need to initialize internal data store:

$ spinta config add backend my_backend postgresql postgresql://localhost/db
$ spinta config add manifest my_manifest tabular manifest-public.csv
$ spinta migrate

Once internal database is initialized, you can push external data into it:

$ spinta push --access open manifest.csv

And now you can publish data via full featured API with a web based data browser:

$ spinta run

You can access your data like this:

$ http :8000/dataset/sql/Country
HTTP/1.1 200 OK
content-type: application/json

{
    "_data": [
        {
            "_type": "dataset/sql/Country",
            "_id": "abdd1245-bbf9-4085-9366-f11c0f737c1d",
            "_rev": "16dabe62-61e9-4549-a6bd-07cecfbc3508",
            "_txn": "792a5029-63c9-4c07-995c-cbc063aaac2c",
            "name": "Vilnius"
        }
    ]
}

$ http :8000/dataset/sql/Country/abdd1245-bbf9-4085-9366-f11c0f737c1d
HTTP/1.1 200 OK
content-type: application/json

{
    "_type": "dataset/sql/Country",
    "_id": "abdd1245-bbf9-4085-9366-f11c0f737c1d",
    "_rev": "16dabe62-61e9-4549-a6bd-07cecfbc3508",
    "_txn": "792a5029-63c9-4c07-995c-cbc063aaac2c",
    "name": "Vilnius"
}

$ http :8000/dataset/sql/Country/abdd1245-bbf9-4085-9366-f11c0f737c1d?select(name)
HTTP/1.1 200 OK
content-type: application/json

{
    "name": "Vilnius"
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.79

Nov 12, 2024

0.1.78

Oct 22, 2024

0.1.77

Oct 22, 2024

0.1.76

Oct 8, 2024

0.1.75

Sep 24, 2024

0.1.74

Sep 24, 2024

0.1.73

Sep 19, 2024

0.1.72

Sep 18, 2024

0.1.71

Sep 12, 2024

This version

0.1.70

Aug 27, 2024

0.1.69

Aug 23, 2024

0.1.68

Aug 23, 2024

0.1.67

Aug 2, 2024

0.1.66

Jul 23, 2024

0.1.65

Jul 3, 2024

0.1.64

Jul 2, 2024

0.1.63

Jun 27, 2024

0.1.62

Feb 29, 2024

0.1.61

Jan 31, 2024

0.1.60

Nov 21, 2023

0.1.59

Nov 14, 2023

0.1.58

Oct 31, 2023

0.1.57

Oct 24, 2023

0.1.56

Sep 30, 2023

0.1.55

Aug 18, 2023

0.1.54

Aug 2, 2023

0.1.53

Aug 1, 2023

0.1.52

Jun 21, 2023

0.1.51

Jun 20, 2023

0.1.50

May 22, 2023

0.1.49

Apr 19, 2023

0.1.48

Apr 14, 2023

0.1.47

Mar 27, 2023

0.1.46

Mar 21, 2023

0.1.45

Mar 20, 2023

0.1.44

Nov 23, 2022

0.1.43

Nov 15, 2022

0.1.42

Nov 8, 2022

0.1.41

Nov 8, 2022

0.1.40

Nov 1, 2022

0.1.39

Oct 12, 2022

0.1.38

Oct 3, 2022

0.1.37

Oct 2, 2022

0.1.36

Jul 25, 2022

0.1.35

May 16, 2022

0.1.34

Apr 22, 2022

0.1.33

Apr 22, 2022

0.1.32

Apr 20, 2022

0.1.31

Apr 20, 2022

0.1.30

Apr 19, 2022

0.1.29

Apr 12, 2022

0.1.28

Mar 17, 2022

0.1.27

Mar 2, 2022

0.1.26

Feb 9, 2022

0.1.25

Feb 8, 2022

0.1.24

Jan 25, 2022

0.1.23

Nov 18, 2021

0.1.22

Nov 11, 2021

0.1.21

Oct 6, 2021

0.1.20

Sep 23, 2021

0.1.19

Aug 5, 2021

0.1.18

Jul 30, 2021

0.1.17

Jul 29, 2021

0.1.16

Jul 23, 2021

0.1.15.dev0 pre-release

Jul 1, 2021

0.1.14

Apr 15, 2021

0.1.13

Apr 1, 2021

0.1.13.dev0 pre-release

Apr 1, 2021

0.1.12

Mar 4, 2021

0.1.11

Mar 4, 2021

0.1.10

Mar 1, 2021

0.1.9

Feb 1, 2021

0.1.8

Jan 28, 2021

0.1.7

Jan 28, 2021

0.1.6

Sep 11, 2020

0.1.5

Jan 8, 2020

0.1.4

Oct 26, 2019

0.1.3

Oct 25, 2019

0.1.2

Oct 25, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spinta-0.1.70.tar.gz (374.7 kB view details)

Uploaded Aug 27, 2024 Source

Built Distribution

spinta-0.1.70-py3-none-any.whl (535.8 kB view details)

Uploaded Aug 27, 2024 Python 3

File details

Details for the file spinta-0.1.70.tar.gz.

File metadata

Download URL: spinta-0.1.70.tar.gz
Upload date: Aug 27, 2024
Size: 374.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.4.2 CPython/3.10.6 Linux/5.15.146.1-microsoft-standard-WSL2

File hashes

Hashes for spinta-0.1.70.tar.gz
Algorithm	Hash digest
SHA256	`d16e2eead45d0e099cc7b02145dcc918c87fbd8c0b3067554b3791fd890e222b`
MD5	`e3a5764dc032ccf67c69e52b50b51e9c`
BLAKE2b-256	`4f9bf1f52da434797d638cb7558238b7b299184eca5a73be21491b03a030dcce`

See more details on using hashes here.

File details

Details for the file spinta-0.1.70-py3-none-any.whl.

File metadata

Download URL: spinta-0.1.70-py3-none-any.whl
Upload date: Aug 27, 2024
Size: 535.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.4.2 CPython/3.10.6 Linux/5.15.146.1-microsoft-standard-WSL2

File hashes

Hashes for spinta-0.1.70-py3-none-any.whl
Algorithm	Hash digest
SHA256	`aac66ebdd01af74594bef014fd596f373de3987a93b0694e0a1d77914c0d7df8`
MD5	`adc5d059fc27c8cd4d35703e42f68ff0`
BLAKE2b-256	`8e4f9a1fa7e078481d76b036e6304a4814f9e9ef57f979cc22694a67bad040ec`

See more details on using hashes here.

spinta 0.1.70

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Purpose

Features

Example

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes