Tools to work with Amsterdam Schema.

These details have not been verified by PyPI

Project links

Homepage

Project description

amsterdam-schema-tools

Set of libraries and tools to work with Amsterdam schema.

Install the package with: pip install amsterdam-schema-tools. This installs the library and a command-line tool called schema, with various subcommands. A listing can be obtained from schema --help.

Subcommands that talk to a PostgreSQL database expect either a DATABASE_URL environment variable or a command line option --db-url with a DSN.

Many subcommands want to know where to find schema files. Most will look in a directory of schemas denoted by the SCHEMA_URL environment variable or the --schema-url command line option. E.g.,

schema create tables --schema-url=myschemas mydataset

will try to load the schema for mydataset from myschemas/mydataset/dataset.json.

Generate amsterdam schema from existing database tables

The --prefix argument controls whether table prefixes are removed in the schema, because that is required for Django models.

As example we can generate a BAG schema. Point DATABASE_URL to bag_v11 database and then run :

schema show tablenames | sort | awk '/^bag_/{print}' | xargs schema introspect db bag --prefix bag_ | jq

The jq formats it nicely and it can be redirected to the correct directory in the schemas repository directly.

Express amsterdam schema information in relational tables

Amsterdam schema is expressed as jsonschema. However, to make it easier for people with a more relational mind- or toolset it is possible to express amsterdam schema as a set of relational tables. These tables are meta_dataset, meta_table and meta_field.

It is possible to convert a jsonschema into the relational table structure and vice-versa.

This command converts a dataset from an existing dataset in jsonschema format:

schema import schema <id of dataset>

To convert from relational tables back to jsonschema:

schema show schema <id of dataset>

Generating amsterdam schema from existing GeoJSON files

The following command can be used to inspect and import the GeoJSON files:

schema introspect geojson <dataset-id> *.geojson > schema.json
edit schema.json  # fine-tune the table names
schema import geojson schema.json <table1> file1.geojson
schema import geojson schema.json <table2> file2.geojson

Importing GOB events

The schematools library has a module that reads GOB events into database tables that are defines by an Amsterdam schema. This module can be used to read GOB events from a Kafka stream. It is also possible to read GOB events from a batch file with line-separeted events using:

schema import events <path-to-dataset> <path-to-file-with-events>

Export datasets

Datasets can be exported to different file formats. Currently supported are geopackage, csv and jsonlines. The command for exporting the dataset tables is:

schema export [geopackage|csv|jsonlines] <id of dataset>

The command has several command-line options that can be used. Documentations about these flags can be shown using the --help options.

Schema Tools as a pre-commit hook

Included in the project is a pre-commit hook that can validate schema files in a project such as amsterdam-schema

To configure it extend the .pre-commit-config.yaml in the project with the schema file defintions as follows:

  - repo: https://github.com/Amsterdam/schema-tools
    rev: v3.5.0
    hooks:
      - id: validate-schema
        args: ['https://schemas.data.amsterdam.nl/schema@v1.2.0#']
        exclude: |
            (?x)^(
                schema.+|             # exclude meta schemas
                datasets/index.json
            )$

args is a one element list containing the URL to the Amsterdam Meta Schema.

validate-schema will only process json files. However not all json files are Amsterdam schema files. To exclude files or directories use exclude with pattern.

pre-commit depends on properly tagged revisions of its hooks. Hence, we should not only bump version numbers on updates to this package, but also commit a tag with the version number; see below.

Doing a release

(This is for schema-tools developers.)

We use GitHub pull requests. If your PR should produce a new release of schema-tools, make sure one of the commit increments the version number in setup.cfg appropriately. Then,

merge the commit in GitHub, after review;
pull the code from GitHub and merge it into the master branch, git checkout master && git fetch origin && git merge --ff-only origin/master;
tag the release X.Y.Z with git tag -a vX.Y.Z -m "Bump to vX.Y.Z";
push the tag to GitHub with git push origin --tags;
release to PyPI: make upload (requires the PyPI secret).

Mocking data

The schematools library contains two Django management commands to generate mock data. The first one is create_mock_data which generates mock data for all the datasets that are found at the configured schema location SCHEMA_URL (where SCHEMA_URL can be configure to point to a path at the local filesystem).

The create_mock_data command processes all datasets. However, it is possible to limit this by adding positional arguments. These positional arguments can be dataset ids or paths to the location of the dataset.json on the local filesystem.

Furthermore, the command has some options, e.g. to change the default number of generated records (--size) or to reverse meaning of the positional arguments using --exclude.

To avoid duplicate primary keys on subsequent runs the --start-at options can be used to start autonumbering of primary keys at an offset.

E.g. to generate 5 records for the bag and gebieden datasets, starting the autonumbering of primary keys at 50.

    django create_mock_data bag gebieden --size 5 --start-at 50

To generate records for all datasets, except for the fietspaaltjes dataset:

    django create_mock_data fietspaaltjes --exclude  # or -x

To generate records for the bbga dataset, by loading the schema from the local filesystem:

    django create_mock_data <path-to-bbga-schema>/datasets.json

During record generation in create_mock_data, the relations are not added, so foreign key fields will be filled with NULL values.

There is a second management command relate_mock_data that can be used to add the relations. This command support positional arguments for datasets in the same way as create_mock_data.
Furthermore, the command also has the --exclude option to reverse the meaning of the positional dataset arguments.

E.g. to add relations to all datasets:

    django relate_mock_data

To add relations for bag and gebieden only:

    django relate_mock_data bag gebieden

To add relations for all datasets except meetbouten:

    django relate_mock_data meetbouten --exclude  # or -x

NB. When only a subset of the datasets is being mocked, the command can fail when datasets that are involved in a relation are missing, so make sure to include all relevant datasets.

For convenience an additional management command truncate_tables has been added, to truncate all tables.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

6.1.2

Nov 13, 2024

6.1.1

Oct 2, 2024

6.1

Oct 1, 2024

6.0.1

Jul 15, 2024

6.0

Jul 11, 2024

5.26.1

Mar 5, 2024

5.26.0

Feb 29, 2024

5.25.0

Feb 8, 2024

5.24.1

Feb 7, 2024

5.24.0

Feb 6, 2024

5.23.4

Jan 31, 2024

5.23.3

Jan 25, 2024

5.23.2

Jan 23, 2024

5.23.1

Jan 23, 2024

5.23.0

Jan 12, 2024

5.22.0

Jan 10, 2024

5.21.2

Jan 8, 2024

5.21.0

Dec 21, 2023

5.20.1

Dec 19, 2023

5.20.0

Dec 15, 2023

5.19.1

Dec 6, 2023

5.17.18

Nov 27, 2023

5.17.17

Oct 25, 2023

5.17.16

Oct 23, 2023

5.17.15

Oct 19, 2023

5.17.14

Oct 18, 2023

This version

5.17.13

Oct 11, 2023

5.17.12

Oct 9, 2023

5.17.11

Oct 6, 2023

5.17.10

Oct 5, 2023

5.17.9

Oct 5, 2023

5.17.8

Oct 4, 2023

5.17.7

Sep 26, 2023

5.17.6

Sep 25, 2023

5.17.5

Sep 25, 2023

5.17.4

Sep 25, 2023

5.17.3

Sep 21, 2023

5.17.2

Sep 19, 2023

5.17.1

Sep 18, 2023

5.17.0

Sep 14, 2023

5.16.1

Sep 14, 2023

5.16.0

Sep 8, 2023

5.15.1

Sep 7, 2023

5.15.0

Sep 6, 2023

5.14.2

Sep 6, 2023

5.14.1

Aug 30, 2023

5.14.0

Aug 28, 2023

5.13.4

Aug 23, 2023

5.13.3

Aug 21, 2023

5.13.2

Jul 24, 2023

5.13.1

Jul 17, 2023

5.13.0

Jul 14, 2023

5.12.5

Jul 3, 2023

5.12.3

Jun 12, 2023

5.12.2

Jun 8, 2023

5.12.1

Jun 8, 2023

5.12.0

Jun 8, 2023

5.11.6

Jun 6, 2023

5.11.5

May 30, 2023

5.11.4

May 25, 2023

5.11.3

May 24, 2023

5.11.2

May 23, 2023

5.11.1

May 17, 2023

5.11.0

May 17, 2023

5.10.2

May 10, 2023

5.10.1

May 8, 2023

5.10.0

May 4, 2023

5.9.3

Apr 20, 2023

5.9.2

Apr 13, 2023

5.9.1

Apr 7, 2023

5.9.0

Apr 6, 2023

5.8.6

Apr 5, 2023

5.8.5

Apr 4, 2023

5.8.4

Apr 3, 2023

5.8.3

Mar 30, 2023

5.8.2

Mar 28, 2023

5.8.1

Mar 23, 2023

5.8.0

Mar 23, 2023

5.7.0

Mar 20, 2023

5.6.12

Mar 8, 2023

5.6.11

Feb 27, 2023

5.6.10

Feb 22, 2023

5.6.9

Feb 21, 2023

5.6.8

Feb 14, 2023

5.6.7

Feb 7, 2023

5.6.6

Feb 7, 2023

5.6.5

Feb 1, 2023

5.6.4

Jan 30, 2023

5.6.3

Jan 30, 2023

5.6.2

Jan 25, 2023

5.6.1

Jan 24, 2023

5.6.0

Jan 23, 2023

5.5.2

Jan 17, 2023

5.5.1

Jan 16, 2023

5.5.0

Jan 16, 2023

5.4.0

Jan 11, 2023

5.3.0

Dec 21, 2022

5.2.0

Dec 20, 2022

5.1.6

Dec 19, 2022

5.1.5

Dec 14, 2022

5.1.4

Dec 13, 2022

5.1.3

Dec 1, 2022

5.1.2

Nov 24, 2022

5.1.1

Nov 22, 2022

5.1

Nov 21, 2022

5.0.2

Nov 15, 2022

5.0.1

Nov 2, 2022

5.0

Oct 31, 2022

4.3.0

Oct 13, 2022

4.2.2

Sep 13, 2022

4.2.1

Aug 23, 2022

4.2.0

Aug 22, 2022

4.1.3

Aug 9, 2022

4.1.2

Aug 9, 2022

4.1.1

Aug 4, 2022

4.1.0

Jul 28, 2022

4.0.0

Jul 28, 2022

3.6.11

Jul 25, 2022

3.6.10

Jul 19, 2022

3.6.9

Jul 13, 2022

3.6.8

Jun 24, 2022

3.6.7

Jun 23, 2022

3.6.6

Jun 15, 2022

3.6.5

Jun 2, 2022

3.6.4

Jun 2, 2022

3.6.3

May 17, 2022

3.6.2

May 16, 2022

3.6.0

May 16, 2022

3.5.3

May 3, 2022

3.5.2

Apr 12, 2022

3.4.2

Mar 23, 2022

3.4.1

Feb 24, 2022

3.4.0

Feb 14, 2022

3.3.7

Feb 1, 2022

3.3.6

Feb 1, 2022

3.3.5

Jan 26, 2022

3.3.4

Jan 20, 2022

3.3.3

Jan 20, 2022

3.3.2

Jan 20, 2022

3.3.1

Jan 19, 2022

3.3.0

Jan 19, 2022

3.2.0

Jan 18, 2022

3.1.4

Dec 7, 2021

3.1.3

Nov 30, 2021

3.1.2

Nov 29, 2021

3.1.0

Nov 28, 2021

3.0.4

Nov 22, 2021

3.0.3

Nov 18, 2021

3.0.2

Nov 17, 2021

3.0.1

Nov 16, 2021

3.0.0

Nov 15, 2021

2.3.2

Nov 11, 2021

2.3.1

Nov 3, 2021

2.3.0

Nov 3, 2021

2.2.1

Nov 1, 2021

2.2.0

Oct 25, 2021

2.1.3

Oct 21, 2021

2.1.2

Oct 21, 2021

2.1.1

Oct 21, 2021

1.0.5

Oct 19, 2021

1.0.4

Oct 11, 2021

1.0.3

Oct 6, 2021

1.0.2

Sep 28, 2021

1.0.1

Sep 16, 2021

1.0.0

Sep 8, 2021

0.23.6

Sep 8, 2021

0.23.5

Sep 8, 2021

0.23.4

Aug 19, 2021

0.23.3

Aug 15, 2021

0.23.2

Aug 5, 2021

0.23.1

Jul 27, 2021

0.23.0

Jul 26, 2021

0.22.2

Jul 15, 2021

0.22.1

Jul 14, 2021

0.22.0

Jul 12, 2021

0.21.13

Jul 8, 2021

0.21.12

Jul 8, 2021

0.21.11

Jul 7, 2021

0.21.10

Jul 6, 2021

0.21.9

Jul 6, 2021

0.21.8

Jul 5, 2021

0.21.7

Jun 23, 2021

0.21.6

Jun 23, 2021

0.21.5

Jun 23, 2021

0.21.4

Jun 16, 2021

0.21.3

May 27, 2021

0.21.1

May 27, 2021

0.21.0

May 19, 2021

0.20.6

May 19, 2021

0.20.5

May 11, 2021

0.20.4

Apr 29, 2021

0.20.3

Apr 26, 2021

0.20.2

Apr 6, 2021

0.20.1

Apr 1, 2021

0.20.0

Apr 1, 2021

0.19.0

Mar 31, 2021

0.18.2

Mar 24, 2021

0.18.1

Mar 23, 2021

0.18.0

Mar 17, 2021

0.17.10

Mar 15, 2021

0.17.9

Mar 11, 2021

0.17.8

Mar 9, 2021

0.17.7

Mar 9, 2021

0.17.6

Mar 9, 2021

0.17.5

Feb 25, 2021

0.17.4

Feb 17, 2021

0.17.3

Feb 9, 2021

0.17.2

Feb 1, 2021

0.17.1

Jan 28, 2021

0.17.0

Jan 27, 2021

0.17a1 pre-release

Jan 29, 2021

0.16.6

Jan 13, 2021

0.16.5

Jan 11, 2021

0.16.4

Dec 23, 2020

0.16.3

Dec 23, 2020

0.16.2

Dec 23, 2020

0.16.1

Dec 23, 2020

0.16.0

Dec 10, 2020

0.15.11

Dec 10, 2020

0.15.10

Dec 9, 2020

0.15.9

Dec 8, 2020

0.15.8

Dec 7, 2020

0.15.7

Dec 1, 2020

0.15.6

Nov 26, 2020

0.15.5

Nov 23, 2020

0.15.4

Nov 23, 2020

0.15.3

Nov 23, 2020

0.15.2

Nov 23, 2020

0.15.1

Nov 19, 2020

0.15.0

Nov 5, 2020

0.14.5

Oct 22, 2020

0.14.4

Oct 21, 2020

0.14.3

Oct 13, 2020

0.14.2

Oct 13, 2020

0.14.1

Oct 8, 2020

0.13.2

Oct 5, 2020

0.13.1

Oct 5, 2020

0.12.5

Sep 29, 2020

0.12.4

Sep 28, 2020

0.12.3

Sep 28, 2020

0.12.2

Sep 24, 2020

0.12.1

Sep 24, 2020

0.12.0

Sep 23, 2020

0.11.0

Sep 21, 2020

0.10.2

Sep 15, 2020

0.10.1

Sep 14, 2020

0.10.0

Sep 14, 2020

0.9.10

Aug 11, 2020

0.9.9

Aug 10, 2020

0.9.8

Jul 21, 2020

0.9.7

Jul 14, 2020

0.9.6

Jul 14, 2020

0.9.5

Jul 14, 2020

0.9.4

Jul 9, 2020

0.9.3

Jul 8, 2020

0.9.2

Jul 6, 2020

0.9.1

Jul 2, 2020

0.9.0

Jul 2, 2020

0.8.13

Jun 30, 2020

0.8.12

Jun 25, 2020

0.8.11

Jun 24, 2020

0.8.10

Jun 23, 2020

0.8.9

Jun 22, 2020

0.8.8

Jun 22, 2020

0.8.7

Jun 19, 2020

0.8.5

Jun 18, 2020

0.8.4

Jun 15, 2020

0.8.3

Jun 10, 2020

0.8.2

May 27, 2020

0.8.1

May 27, 2020

0.8.0

May 25, 2020

0.7.1

May 20, 2020

0.7.0

May 20, 2020

0.6.0

May 13, 2020

0.5.1

May 11, 2020

0.5.0

May 7, 2020

0.4.2

May 4, 2020

0.4.1

Apr 23, 2020

0.4.0

Apr 23, 2020

0.0.4

Apr 20, 2020

0.0.3

Mar 11, 2020

0.0.2

Feb 4, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amsterdam-schema-tools-5.17.13.tar.gz (165.0 kB view details)

Uploaded Oct 11, 2023 Source

Built Distribution

amsterdam_schema_tools-5.17.13-py3-none-any.whl (170.6 kB view details)

Uploaded Oct 11, 2023 Python 3

File details

Details for the file amsterdam-schema-tools-5.17.13.tar.gz.

File metadata

Download URL: amsterdam-schema-tools-5.17.13.tar.gz
Upload date: Oct 11, 2023
Size: 165.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for amsterdam-schema-tools-5.17.13.tar.gz
Algorithm	Hash digest
SHA256	`32536a5cac5789ee5c1b2c0ea0f2e69f44134618653c945b30618414861580b8`
MD5	`de1a5a8548eddc73ba4ea6463aad8f7e`
BLAKE2b-256	`71dbdd7657c566d59bc72ba2f6a2a743cebc6088c9b14f8a71291de6e913ca5b`

See more details on using hashes here.

File details

Details for the file amsterdam_schema_tools-5.17.13-py3-none-any.whl.

File metadata

Download URL: amsterdam_schema_tools-5.17.13-py3-none-any.whl
Upload date: Oct 11, 2023
Size: 170.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for amsterdam_schema_tools-5.17.13-py3-none-any.whl
Algorithm	Hash digest
SHA256	`452a2f80e9227b97a104f3a84989ae9bda33d08aa7ce4de027ef966763615c86`
MD5	`f156d9ea8b28548cc750048788457c5b`
BLAKE2b-256	`afbdc3b7381d1eca5189578454d82ceb367b1e37c390f1a55ebf059ac80cdc4c`