A utility library for working with Table Schema in Python

These details have not been verified by PyPI

Project links

Homepage

Project description

tableschema-py

A library for working with Table Schema in Python.

Features

Table to work with data tables described by Table Schema
Schema representing Table Schema
Field representing Table Schema field
validate to validate Table Schema
infer to infer Table Schema from data
built-in command-line interface to validate and infer schemas
storage/plugins system to connect tables to different storage backends like SQL Database

Gettings Started
- Installation
- Examples
Documentation
- Table
- Schema
- Field
- validate
- infer
- Exceptions
- Storage
- Plugins
- CLI
Contributing
Changelog

Gettings Started

Installation

The package use semantic versioning. It means that major versions could include breaking changes. It's highly recommended to specify tableschema version range in your setup/requirements file e.g. tableschema>=1.0,<2.0.

$ pip install tableschema

Examples

Code examples in this readme requires Python 3.4+ interpreter. You could see even more example in examples directory.

from tableschema import Table

# Create table
table = Table('path.csv', schema='schema.json')

# Print schema descriptor
print(table.schema.descriptor)

# Print cast rows in a dict form
for keyed_row in table.iter(keyed=True):
    print(keyed_row)

Documentation

Table

A table is a core concept in a tabular data world. It represents data with metadata (Table Schema). Let's see how we can use it in practice.

Consider we have some local csv file. It could be inline data or from a remote link - all supported by the Table class (except local files for in-brower usage of course). But say it's data.csv for now:

city,location
london,"51.50,-0.11"
paris,"48.85,2.30"
rome,N/A

Let's create and read a table instance. We use the static Table.load method and the table.read method with the keyed option to get an array of keyed rows:

table = Table('data.csv')
table.headers # ['city', 'location']
table.read(keyed=True)
# [
#   {city: 'london', location: '51.50,-0.11'},
#   {city: 'paris', location: '48.85,2.30'},
#   {city: 'rome', location: 'N/A'},
# ]

As we can see, our locations are just strings. But they should be geopoints. Also, Rome's location is not available, but it's just a string N/A instead of None. First we have to infer Table Schema:

table.infer()
table.schema.descriptor
# { fields:
#   [ { name: 'city', type: 'string', format: 'default' },
#     { name: 'location', type: 'geopoint', format: 'default' } ],
#  missingValues: [ '' ] }
table.read(keyed=True)
# Fails with a data validation error

Let's fix the "not available" location. There is a missingValues property in Table Schema specification. As a first try we set missingValues to N/A in table.schema.descriptor. The schema descriptor can be changed in-place, but all changes should also be committed using table.schema.commit():

table.schema.descriptor['missingValues'] = 'N/A'
table.schema.commit()
table.schema.valid # false
table.schema.errors
# [<ValidationError: "'N/A' is not of type 'array'">]

As a good citizens we've decided to check our schema descriptor's validity. And it's not valid! We should use an array for the missingValues property. Also, don't forget to include "empty string" as a valid missing value:

table.schema.descriptor['missingValues'] = ['', 'N/A']
table.schema.commit()
table.schema.valid # true

All good. It looks like we're ready to read our data again:

table.read(keyed=True)
# [
#   {city: 'london', location: [51.50,-0.11]},
#   {city: 'paris', location: [48.85,2.30]},
#   {city: 'rome', location: null},
# ]

Now we see that:

locations are arrays with numeric latitude and longitude
Rome's location is a native Python None

And because there are no errors after reading, we can be sure that our data is valid against our schema. Let's save it:

table.schema.save('schema.json')
table.save('data.csv')

Our data.csv looks the same because it has been stringified back to csv format. But now we have schema.json:

{
    "fields": [
        {
            "name": "city",
            "type": "string",
            "format": "default"
        },
        {
            "name": "location",
            "type": "geopoint",
            "format": "default"
        }
    ],
    "missingValues": [
        "",
        "N/A"
    ]
}

If we decide to improve it even more we could update the schema file and then open it again. But now providing a schema path:

table = Table('data.csv', schema='schema.json')
# Continue the work

This is a basic introduction to the Table class. To learn more let's take a look at the Table class API reference.

`Table(source, schema=None, strict=False, post_cast=[], storage=None, **options)`

Constructor to instantiate Table class. If references argument is provided, foreign keys will be checked on any reading operation.

source (str/list[]) - data source (one of):
- local file (path)
- remote file (url)
- array of arrays representing the rows
schema (any) - data schema in all forms supported by Schema class
strict (bool) - strictness option to pass to Schema constructor
post_cast (function[]) - list of post cast processors
storage (None/str) - storage name like sql or bigquery
options (dict) - tabulator or storage options
(exceptions.TableSchemaException) - raises any error that occurs in table creation process
(Table) - returns data table class instance

`table.headers`

(str[]) - returns data source headers

`table.schema`

(Schema) - returns schema class instance

`table.iter(keyed=Fase, extended=False, cast=True, relations=False)`

Iterates through the table data and emits rows cast based on table schema. Data casting can be disabled.

keyed (bool) - iterate keyed rows
extended (bool) - iterate extended rows
cast (bool) - disable data casting if false
relations (dict) - dictionary of foreign key references in a form of {resource1: [{field1: value1, field2: value2}, ...], ...}. If provided, foreign key fields will checked and resolved to their references
(exceptions.TableSchemaException) - raises any error that occurs during this process
(any[]/any{}) - yields rows:
- [value1, value2] - base
- {header1: value1, header2: value2} - keyed
- [rowNumber, [header1, header2], [value1, value2]] - extended

`table.read(keyed=False, extended=False, cast=True, relations=False, limit=None)`

Read the whole table and returns as array of rows. Count of rows could be limited.

keyed (bool) - flag to emit keyed rows
extended (bool) - flag to emit extended rows
cast (bool) - flag to disable data casting if false
relations (dict) - dict of foreign key references in a form of {resource1: [{field1: value1, field2: value2}, ...], ...}. If provided foreign key fields will checked and resolved to its references
limit (int) - integer limit of rows to return
(exceptions.TableSchemaException) - raises any error that occurs during this process
(list[]) - returns array of rows (see table.iter)

`table.infer(limit=100, confidence=0.75)`

Infer a schema for the table. It will infer and set Table Schema to table.schema based on table data.

limit (int) - limit rows sample size
confidence (float) - how many casting errors are allowed (as a ratio, between 0 and 1)
(dict) - returns Table Schema descriptor

`table.save(target, storage=None, **options)`

To save schema use table.schema.save()

Save data source to file locally in CSV format with , (comma) delimiter

target (str) - saving target (e.g. file path)
storage (None/str) - storage name like sql or bigquery
options (dict) - tabulator or storage options
(exceptions.TableSchemaException) - raises an error if there is saving problem
(True/Storage) - returns true or storage instance

Schema

A model of a schema with helpful methods for working with the schema and supported data. Schema instances can be initialized with a schema source as a url to a JSON file or a JSON object. The schema is initially validated (see validate below). By default validation errors will be stored in schema.errors but in a strict mode it will be instantly raised.

Let's create a blank schema. It's not valid because descriptor.fields property is required by the Table Schema specification:

schema = Schema()
schema.valid # false
schema.errors
# [<ValidationError: "'fields' is a required property">]

To avoid creating a schema descriptor by hand we will use a schema.infer method to infer the descriptor from given data:

schema.infer([
  ['id', 'age', 'name'],
  ['1','39','Paul'],
  ['2','23','Jimmy'],
  ['3','36','Jane'],
  ['4','28','Judy'],
])
schema.valid # true
schema.descriptor
#{ fields:
#   [ { name: 'id', type: 'integer', format: 'default' },
#     { name: 'age', type: 'integer', format: 'default' },
#     { name: 'name', type: 'string', format: 'default' } ],
#  missingValues: [ '' ] }

Now we have an inferred schema and it's valid. We can cast data rows against our schema. We provide a string input which will be cast correspondingly:

schema.cast_row(['5', '66', 'Sam'])
# [ 5, 66, 'Sam' ]

But if we try provide some missing value to the age field, the cast will fail because the only valid "missing" value is an empty string. Let's update our schema:

schema.cast_row(['6', 'N/A', 'Walt'])
# Cast error
schema.descriptor['missingValues'] = ['', 'N/A']
schema.commit()
schema.cast_row(['6', 'N/A', 'Walt'])
# [ 6, None, 'Walt' ]

We can save the schema to a local file, and resume work on it at any time by loading it from that file:

schema.save('schema.json')
schema = Schema('schema.json')

This was a basic introduction to the Schema class. To learn more, let's take a look at the Schema API reference.

`Schema(descriptor, strict=False)`

Constructor to instantiate Schema class.

descriptor (str/dict) - schema descriptor:
- local path
- remote url
- dictionary
strict (bool) - flag to specify validation behaviour:
- if false, errors will not be raised but instead collected in schema.errors
- if true, validation errors are raised immediately
(exceptions.TableSchemaException) - raise any error that occurs during the process
(Schema) - returns schema class instance

`schema.valid`

(bool) - returns validation status. Always true in strict mode.

`schema.errors`

(Exception[]) - returns validation errors. Always empty in strict mode.

`schema.descriptor`

(dict) - returns schema descriptor

`schema.primary_key`

(str[]) - returns schema primary key

`schema.foreign_keys`

(dict[]) - returns schema foreign keys

`schema.fields`

(Field[]) - returns an array of Field instances

`schema.field_names`

(str[]) - returns an array of field names.

`schema.get_field(name)`

Get schema field by name.

Note: use update_field if you want to modify the field descriptor

name (str) - schema field name
(Field/None) - returns Field instance or None if not found

`schema.add_field(descriptor)`

Add new field to schema. The schema descriptor will be validated with newly added field descriptor.

descriptor (dict) - field descriptor
(exceptions.TableSchemaException) - raises any error that occurs during the process
(Field/None) - returns added Field instance or None if not added

`schema.update_field(name, update)`

Update existing descriptor field by name

name (str) - schema field name
update (dict) - update to apply to field's descriptor
(bool) - returns true on success and false if no field is found to be modified

cf schema.commit() example

`schema.remove_field(name)`

Remove field resource by name. The schema descriptor will be validated after field descriptor removal.

name (str) - schema field name
(exceptions.TableSchemaException) - raises any error that occurs during the process
(Field/None) - returns removed Field instances or None if not found

`schema.cast_row(row)`

Cast row based on field types and formats.

row (any[]) - data row as an array of values
(any[]) - returns cast data row

`schema.infer(rows, headers=1, confidence=0.75, guesser_cls=None, resolver_cls=None)`

Infer and set schema.descriptor based on data sample.

rows (list[]) - array of arrays representing rows.
headers (int/str[]) - data sample headers (one of):
- row number containing headers (rows should contain headers rows)
- array of headers (rows should NOT contain headers rows)
confidence (float) - how many casting errors are allowed (as a ratio, between 0 and 1)
guesser_cls & resolver_cls - you can implement inferring strategies by providing type-guessing and type-resolving classes [experimental]
{dict} - returns Table Schema descriptor

`schema.commit(strict=None)`

Update schema instance if there are in-place changes in the descriptor.

strict (bool) - alter strict mode for further work
(exceptions.TableSchemaException) - raises any error that occurs during the process
(bool) - returns true on success and false if not modified

from tableschema import Schema
descriptor = {'fields': [{'name': 'my_field', 'title': 'My Field', 'type': 'string'}]}
schema = Schema(descriptor)
print(schema.get_field('my_field').descriptor['type']) # string

# Update descriptor by field position
schema.descriptor['fields'][0]['type'] = 'number'
# Update descriptor by field name
schema.update_field('my_field', {'title': 'My Pretty Field'}) # True

# Change are not committed
print(schema.get_field('my_field').descriptor['type']) # string
print(schema.get_field('my_field').descriptor['title']) # My Field


# Commit change
schema.commit()
print(schema.get_field('my_field').descriptor['type']) # number
print(schema.get_field('my_field').descriptor['title']) # My Pretty Field

`schema.save(target)`

Save schema descriptor to target destination.

target (str) - path where to save a descriptor
(exceptions.TableSchemaException) - raises any error that occurs during the process
(bool) - returns true on success

Field

from tableschema import Field

# Init field
field = Field({'name': 'name', 'type': 'number'})

# Cast a value
field.cast_value('12345') # -> 12345

Data values can be cast to native Python objects with a Field instance. Type instances can be initialized with field descriptors. This allows formats and constraints to be defined.

Casting a value will check the value is of the expected type, is in the correct format, and complies with any constraints imposed by a schema. E.g. a date value (in ISO 8601 format) can be cast with a DateType instance. Values that can't be cast will raise an InvalidCastError exception.

Casting a value that doesn't meet the constraints will raise a ConstraintError exception.

Here is an API reference for the Field class:

`new Field(descriptor, missingValues=[''])`

Constructor to instantiate Field class.

descriptor (dict) - schema field descriptor
missingValues (str[]) - an array with string representing missing values
(exceptions.TableSchemaException) - raises any error that occurs during the process
(Field) - returns field class instance

`field.schema`

(Schema) - returns a schema instance if the field belongs to some schema

`field.name`

(str) - returns field name

`field.type`

(str) - returns field type

`field.format`

(str) - returns field format

`field.required`

(bool) - returns true if field is required

`field.constraints`

(dict) - returns an object with field constraints

`field.descriptor`

(dict) - returns field descriptor

`field.castValue(value, constraints=true)`

Cast given value according to the field type and format.

value (any) - value to cast against field
constraints (boll/str[]) - gets constraints configuration
- it could be set to true to disable constraint checks
- it could be an Array of constraints to check e.g. ['minimum', 'maximum']
(exceptions.TableSchemaException) - raises any error that occurs during the process
(any) - returns cast value

`field.testValue(value, constraints=true)`

Test if value is compliant to the field.

value (any) - value to cast against field
constraints (bool/str[]) - constraints configuration
(bool) - returns if value is compliant to the field

validate

Given a schema as JSON file, url to JSON file, or a Python dict, validate returns true for a valid Table Schema, or raises an exception, exceptions.ValidationError. It validates only schema, not data against schema!

from tableschema import validate, exceptions

try:
    valid = validate(descriptor)
except exceptions.ValidationError as exception:
   for error in exception.errors:
       # handle individual error

`validate(descriptor)`

Validate a Table Schema descriptor.

descriptor (str/dict) - schema descriptor (one of):
- local path
- remote url
- object
(exceptions.ValidationError) - raises on invalid
(bool) - returns true on valid

infer

Given headers and data, infer will return a Table Schema as a Python dict based on the data values. Given the data file, data_to_infer.csv:

id,age,name
1,39,Paul
2,23,Jimmy
3,36,Jane
4,28,Judy

Let's call infer for this file:

from tableschema import infer

descriptor = infer('data_to_infer.csv')
#{'fields': [
#    {
#        'format': 'default',
#        'name': 'id',
#        'type': 'integer'
#    },
#    {
#        'format': 'default',
#        'name': 'age',
#        'type': 'integer'
#    },
#    {
#        'format': 'default',
#        'name': 'name',
#        'type': 'string'
#    }]
#}

The number of rows used by infer can be limited with the limit argument.

`infer(source, headers=1, limit=100, confidence=0.75, **options)`

Infer source schema.

source (any) - source as path, url or inline data
headers (int/str[]) - headers rows number or headers list
confidence (float) - how many casting errors are allowed (as a ratio, between 0 and 1)
(exceptions.TableSchemaException) - raises any error that occurs during the process
(dict) - returns schema descriptor

Exceptions

`exceptions.TableSchemaException`

Base class for all library exceptions. If there are multiple errors, they can be read from the exception object:

try:
    # lib action
except exceptions.TableSchemaException as exception:
    if exception.multiple:
        for error in exception.errors:
            # handle error

`exceptions.LoadError`

All loading errors.

`exceptions.ValidationError`

All validation errors.

`exceptions.CastError`

All value cast errors.

`exceptions.RelationError`

All integrity errors.

`exceptions.StorageError`

All storage errors.

Storage

The library includes interface declaration to implement tabular Storage. This interface allow to use different data storage systems like SQL with tableschema.Table class (load/save) as well as on the data package level:

Storage

For instantiation of concrete storage instances, tableschema.Storage provides a unified factory method connect (which uses the plugin system under the hood):

# pip install tableschema_sql
from tableschema import Storage

storage = Storage.connect('sql', **options)
storage.create('bucket', descriptor)
storage.write('bucket', rows)
storage.read('bucket')

`Storage.connect(name, **options)`

Create tabular storage based on storage name.

name (str) - storage name like sql
options (dict) - concrete storage options
(exceptions.StorageError) - raises on any error
(Storage) - returns Storage instance

An implementor should follow tableschema.Storage interface to write his own storage backend. Concrete storage backends could include additional functionality specific to conrete storage system. See plugins below to know how to integrate custom storage plugin into your workflow.

`<<Interface>>Storage(**options)`

Create tabular storage. Implementations should fully implement this interface to be compatible with the Storage API.

options (dict) - concrete storage options
(exceptions.StorageError) - raises on any error
(Storage) - returns Storage instance

`storage.buckets`

Return list of storage bucket names. A bucket is a special term which has almost the same meaning as table. You should consider bucket as a table stored in the storage.

(exceptions.StorageError) - raises on any error
str[] - return list of bucket names

`create(bucket, descriptor, force=False)`

Create one/multiple buckets.

bucket (str/list) - bucket name or list of bucket names
descriptor (dict/dict[]) - schema descriptor or list of descriptors
force (bool) - whether to delete and re-create already existing buckets
(exceptions.StorageError) - raises on any error

`delete(bucket=None, ignore=False)`

Delete one/multiple/all buckets.

bucket (str/list/None) - bucket name or list of bucket names to delete. If None, all buckets will be deleted
descriptor (dict/dict[]) - schema descriptor or list of descriptors
ignore (bool) - don't raise an error on non-existent bucket deletion from storage
(exceptions.StorageError) - raises on any error

`describe(bucket, descriptor=None)`

Get/set bucket's Table Schema descriptor.

bucket (str) - bucket name
descriptor (dict/None) - schema descriptor to set
(exceptions.StorageError) - raises on any error
(dict) - returns Table Schema descriptor

`iter(bucket)`

This method should return an iterator of typed values based on the schema of this bucket.

bucket (str) - bucket name
(exceptions.StorageError) - raises on any error
(list[]) - yields data rows

`read(bucket)`

This method should read typed values based on the schema of this bucket.

bucket (str) - bucket name
(exceptions.StorageError) - raises on any error
(list[]) - returns data rows

`write(bucket, rows)`

This method writes data rows into storage. It should store values of unsupported types as strings internally (like csv does).

bucket (str) - bucket name
rows (list[]) - data rows to write
(exceptions.StorageError) - raises on any error

Plugins

Table Schema has a plugin system. Any package with the name like tableschema_<name> can be imported as:

from tableschema.plugins import <name>

If a plugin is not installed, an ImportError will be raised with a message describing how to install the plugin.

Official plugins

CLI

It's a provisional API excluded from SemVer. If you use it as a part of another program please pin tableschema to a concrete version in your requirements file.

Table Schema features a CLI called tableschema. This CLI exposes the infer and validate functions for command line use.

Example of validate usage:

$ tableschema validate path/to-schema.json

Example of infer usage:

$ tableschema infer path/to/data.csv

The response is a schema as JSON. The optional argument --encoding allows a character encoding to be specified for the data file. The default is utf-8.

Contributing

The project follows the Open Knowledge International coding standards.

The recommended way to get started is to create and activate a project virtual environment. To install package and development dependencies into your active environment:

$ make install

To run tests with linting and coverage:

$ make test

For linting, pylama (configured in pylama.ini) is used. At this stage it's already installed into your environment and could be used separately with more fine-grained control as described in documentation - https://pylama.readthedocs.io/en/latest/.

For example to sort results by error type:

$ pylama --sort <path>

For testing, tox (configured in tox.ini) is used. It's already installed into your environment and could be used separately with more fine-grained control as described in documentation - https://testrun.org/tox/latest/.

For example to check subset of tests against Python 2 environment with increased verbosity. All positional arguments and options after -- will be passed to py.test:

tox -e py27 -- -v tests/<path>

Under the hood tox uses pytest (configured in pytest.ini), coverage and mock packages. These packages are available only in tox envionments.

Changelog

Here described only breaking and the most important changes. The full changelog and documentation for all released versions can be found in the nicely formatted commit history.

v1.7

Added field.schema property

v1.6

In strict mode raise an exception if there are problems in field construction

v1.5

Allow providing custom guesser and resolver to schema infer

v1.4

Added schema.update_field method

v1.3

Support datetime with no time for date casting

v1.2

Support floats like 1.0 for integer casting

v1.1

Added the confidence parameter to infer

v1.0

The library has been rebased on the Frictionless Data specs v1 - https://frictionlessdata.io/specs/table-schema/

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.20.11

Apr 1, 2024

1.20.10

Mar 22, 2024

1.20.9

Mar 13, 2024

1.20.7

Mar 13, 2024

1.20.6

Mar 12, 2024

1.20.5

Mar 12, 2024

1.20.4

Mar 12, 2024

1.20.3

Mar 12, 2024

1.20.2

Feb 24, 2021

1.20.1

Feb 24, 2021

1.20.0

Oct 6, 2020

1.19.5

Sep 26, 2020

1.19.4

Sep 12, 2020

1.19.3

Aug 15, 2020

1.19.2

Jun 3, 2020

1.18.0

May 20, 2020

1.17.2

May 18, 2020

1.17.0

Apr 29, 2020

1.16.4

Apr 27, 2020

1.16.2

Apr 24, 2020

1.16.1

Apr 24, 2020

1.16.0

Apr 23, 2020

1.15.3

Mar 26, 2020

1.15.2

Mar 26, 2020

1.15.0

Mar 3, 2020

1.14.0

Mar 2, 2020

1.13.1

Feb 19, 2020

1.13.0

Feb 18, 2020

1.12.5

Feb 10, 2020

1.12.4

Feb 5, 2020

1.12.3

Jan 13, 2020

1.12.2

Dec 17, 2019

1.12.1

Dec 15, 2019

1.12.0

Dec 10, 2019

1.11.0

Nov 25, 2019

1.10.0

Oct 31, 2019

1.9.0

Oct 31, 2019

1.8.0

Oct 9, 2019

This version

1.7.4

Sep 27, 2019

1.7.3

Sep 26, 2019

1.7.2

Sep 18, 2019

1.7.1

Sep 18, 2019

1.7.0

Sep 3, 2019

1.6.0

Jul 8, 2019

1.5.4

Jun 28, 2019

1.5.3

Jun 23, 2019

1.5.2

Jun 10, 2019

1.5.1

Jun 6, 2019

1.5.0

May 23, 2019

1.4.1

Apr 17, 2019

1.4.0

Apr 11, 2019

1.3.3

Mar 25, 2019

1.3.2

Mar 25, 2019

1.3.1

Mar 25, 2019

1.3.0

Nov 26, 2018

1.2.5

Oct 18, 2018

1.2.4

Oct 9, 2018

1.2.3

Oct 8, 2018

1.2.2

Sep 19, 2018

1.2.1

Sep 12, 2018

1.2.0

Aug 16, 2018

1.1.0

May 29, 2018

1.0.13

Apr 12, 2018

1.0.12

Feb 20, 2018

1.0.11

Dec 20, 2017

1.0.10

Nov 20, 2017

1.0.9

Nov 20, 2017

1.0.8

Oct 1, 2017

1.0.7

Sep 30, 2017

1.0.6

Sep 30, 2017

1.0.5

Sep 30, 2017

1.0.4

Sep 27, 2017

1.0.3

Sep 20, 2017

1.0.2

Sep 18, 2017

1.0.1

Sep 7, 2017

1.0.0

Sep 4, 2017

1.0.0a14 pre-release

Aug 31, 2017

1.0.0a13 pre-release

Aug 29, 2017

1.0.0a12 pre-release

Aug 22, 2017

1.0.0a11 pre-release

Aug 22, 2017

1.0.0a10 pre-release

Aug 22, 2017

1.0.0a9 pre-release

Aug 19, 2017

1.0.0a8 pre-release

Jul 27, 2017

1.0.0a7 pre-release

Jun 9, 2017

1.0.0a5 pre-release

May 25, 2017

1.0.0a4 pre-release

Apr 11, 2017

1.0.0a3 pre-release

Apr 5, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tableschema-1.7.4.tar.gz (4.8 MB view hashes)

Uploaded Sep 27, 2019 Source

Built Distribution

tableschema-1.7.4-py2.py3-none-any.whl (57.0 kB view hashes)

Uploaded Sep 27, 2019 Python 2 Python 3

Hashes for tableschema-1.7.4.tar.gz

Hashes for tableschema-1.7.4.tar.gz
Algorithm	Hash digest
SHA256	`f3f38c1143f881e3d34b96439859fd00e5688dc6fd0dc292c41d5812f140b669`
MD5	`6a08a4efa388f5603193bd715ef015ce`
BLAKE2b-256	`a1c46c71ba6e59a9f59cd89b2e20913a7656686c465817f1135070fc124171af`

Hashes for tableschema-1.7.4-py2.py3-none-any.whl

Hashes for tableschema-1.7.4-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`6edea82839bfef161eacdbadca0a2c96a3b376070fecf5ebd81c020806e77104`
MD5	`36e51d4e29680ac5b9113507e1a1df0b`
BLAKE2b-256	`893db5a4045193069e85263008e3bb08b376fc4fbeec10ab533a9a4897b70260`

tableschema 1.7.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

tableschema-py

Features

Contents

Gettings Started

Installation

Examples

Documentation

Table

Table(source, schema=None, strict=False, post_cast=[], storage=None, **options)

table.headers

table.schema

table.iter(keyed=Fase, extended=False, cast=True, relations=False)

table.read(keyed=False, extended=False, cast=True, relations=False, limit=None)

table.infer(limit=100, confidence=0.75)

table.save(target, storage=None, **options)

Schema

Schema(descriptor, strict=False)

schema.valid

schema.errors

schema.descriptor

schema.primary_key

schema.foreign_keys

schema.fields

schema.field_names

schema.get_field(name)

schema.add_field(descriptor)

schema.update_field(name, update)

schema.remove_field(name)

schema.cast_row(row)

schema.infer(rows, headers=1, confidence=0.75, guesser_cls=None, resolver_cls=None)

schema.commit(strict=None)

schema.save(target)

Field

new Field(descriptor, missingValues=[''])

field.schema

field.name

field.type

field.format

field.required

field.constraints

field.descriptor

field.castValue(value, constraints=true)

field.testValue(value, constraints=true)

validate

validate(descriptor)

infer

infer(source, headers=1, limit=100, confidence=0.75, **options)

Exceptions

exceptions.TableSchemaException

exceptions.LoadError

exceptions.ValidationError

exceptions.CastError

exceptions.RelationError

exceptions.StorageError

Storage

Storage.connect(name, **options)

<<Interface>>Storage(**options)

storage.buckets

create(bucket, descriptor, force=False)

delete(bucket=None, ignore=False)

describe(bucket, descriptor=None)

iter(bucket)

read(bucket)

write(bucket, rows)

Plugins

Official plugins

CLI

Contributing

Changelog

v1.7

v1.6

`Table(source, schema=None, strict=False, post_cast=[], storage=None, **options)`

`table.headers`

`table.schema`

`table.iter(keyed=Fase, extended=False, cast=True, relations=False)`

`table.read(keyed=False, extended=False, cast=True, relations=False, limit=None)`

`table.infer(limit=100, confidence=0.75)`

`table.save(target, storage=None, **options)`

`Schema(descriptor, strict=False)`

`schema.valid`

`schema.errors`

`schema.descriptor`

`schema.primary_key`

`schema.foreign_keys`

`schema.fields`

`schema.field_names`

`schema.get_field(name)`

`schema.add_field(descriptor)`

`schema.update_field(name, update)`

`schema.remove_field(name)`

`schema.cast_row(row)`

`schema.infer(rows, headers=1, confidence=0.75, guesser_cls=None, resolver_cls=None)`

`schema.commit(strict=None)`

`schema.save(target)`

`new Field(descriptor, missingValues=[''])`

`field.schema`

`field.name`

`field.type`

`field.format`

`field.required`

`field.constraints`

`field.descriptor`

`field.castValue(value, constraints=true)`

`field.testValue(value, constraints=true)`

`validate(descriptor)`

`infer(source, headers=1, limit=100, confidence=0.75, **options)`

`exceptions.TableSchemaException`

`exceptions.LoadError`

`exceptions.ValidationError`

`exceptions.CastError`

`exceptions.RelationError`

`exceptions.StorageError`

`Storage.connect(name, **options)`

`<<Interface>>Storage(**options)`

`storage.buckets`

`create(bucket, descriptor, force=False)`

`delete(bucket=None, ignore=False)`

`describe(bucket, descriptor=None)`

`iter(bucket)`

`read(bucket)`

`write(bucket, rows)`